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FIELD OF THE INVENTION 

The present invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, amino acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
5 compositions comprising any of the above. The present invention further concerns 
methods for screening for candidate activators or deactivators utilizing said amino 
acid sequences. 

BACKGROUND OF THE INVENTION 

Alternative splicing (AS)" is an important regulatory mechanism in higher 

10 eukaryotes (P.A. Sharp, Cell 77, 805-8152 (1994). It is thought to be one of the 
most important mechanisms for differential expression related to tissue or 
development stage specificity. It is known to play a major role in numerous 
biological systems, including human antibody responses, and sex determination 
in Drosophila, (S. Stamm, M.Q. Zhang, T.G. Marr and D.M. Helfman, Nucleic 

15 Acids Research 22,. 1515-1526 (1994); B. Chabot, Trends Genet 12, 472-478 
(1996); R.E. Breitbart, A. Andreadis, B. Nadal-Ginard, Annual Rev. Biochem., 
56, 467-495 (1987); C.W. Smith, J.G. Patton, B. Nadal-Ginard, Annu. Rev. 
' Genet, 27, 527-577 (1989)). 

Until recently it was commonly believed that alternative splicing existed in 

20 only a small fraction of genes (about 5%). A recent observation based on 
literature survey of known genes revises this conservative estimate to as high as 
an estimate that at least 30% of human genes are alternatively spliced (M.S. 
Gelfand, I. Dubchak, I. Draluk and M. Zom, Nucleic Acids Research 27, 301-302 
(1999). The importance of the actual frequency of this phenomenon lies not only 

25 in the direct impact on the number of proteins created (100,000 human genes, for. 
example, would be translated to a much higher number of proteins), but also in 
the diversity of functionality derived from the process. 
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Several mechanisms at different stages may be held responsible for the 
complexity of higher eukaryote which- include: alternative splicing at the 
transcription level, RNA editing at the post-transcriptional level, and 
post-translational modifications are the ones characterized to date. 

5 

GLOSSARY 

In the following description and claims use will be made, at times, with a 
variety of terms, and the meaning of such terms as they should be construed in 
accordance with the invention is as follows: 
10 ~ - 

"Variant nucleic acid sequence 99 — the sequence shown in any one of SEQ ID 
NO: 1 to SEQ ID NO: 17, sequences having at least 90% identity (see below) to 
said sequence and fragments (see below) of the above sequences of least 20 b.p. 
long. These sequences are sequences coding for a novel, naturally occurring, 
15 alternative splice variants of native and known genes. It should be emphasized that 
the novel variants of the present invention are naturally occurring sequences 
resulting from alternative splicing of genes and not merely truncated, mutated or" 
fragmented forms of known sequences which are artificially produced. 

20 "Variant product - also referred at times as the Variant protein 99 or "variant 
polypeptide 99 - is an amino acid sequence encoded by the variant nucleic acid 
sequence which is a naturally occurring mRNA sequence obtained as a result of 
alternative splicing. The amino acid sequence may be a peptide, a protein, as well 
as peptides or proteins having chemically modified amino acids (see below) such as 

25 a glycopeptide or glycoprotein. The variant products are shown in any one of SEQ 
ED NO: 18 to SEQ ID NO: 34. The term also includes homologies (see below) of 
said sequences in which one or more amino acids has been added, deleted, 
substituted (see below) or chemically modified (see below) as well as fragments 
(see below) of this sequence having at least 10 amino acids. 

30 
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"Nucleic acid sequence" - a sequence composed of DNA nucleotides"' RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemically modified nucleotides and synthetic nucleotides. 

5 "Amino acid sequence" - a sequence composed of any one of the 20 naturally 
appearing amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 

"Fragment of variant nucleic acid sequence" - novel short stretch of nucleic 

10 acid sequences of at least 20 b.p., which does not appear as a continuous stretch 
in the original nucleic acid sequence (see below). The fragment may be a 
sequence which was previously undescribed in the contejrt of the published RNA 
and which affects the amino acid sequence encoded by the known gene. For 
example, where the variant nucleic includes a sequence which was not included 

15 in the original sequence (for example a sequence which was an intron in the 
original sequence) the fragment may contain said additional sequence. The 
fragment may also be a region which is not an intron, which was not present in 
the original sequence. For example where the variant lacks a non-terminal region 
which was present in the original sequence. The two^ stretches of nucleotides 

20 spanning this region (upstream and downstream) are brought together by splicing 
in the variant, but are spaced from each by the spliced out region in the original 
sequence and are thus not continuous in the original sequence. A continuous 
stretch of nucleic acids comprising said two sparing stretches of nucleotides is 
not present in the original sequence and thus falls under the definition of 

25 fragment. 

"Fragments of variant products" - novel amino acid sequences coded by the 
'fragment of variant nucleic acid sequence " defined above. 
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*"Homologues of variants' 9 - amino acid sequences of variants in which one or * 
more amino acids has been added, deleted or replaced. The addition, deletion or 
replacement should be in the regions or adjacent to regions where the variant 
differs from the original sequence (see below). 

5 

"Conservative substitution " - refers to the substitution of an amino acid in one 
class by an amino acid of the same class, where a class is defined by common 
physicochemical amino acid side chain properties and high substitution 
frequencies in homologous proteins found in nature, as determined, for example, 

10 by a standard Dayhoff frequency exchange matrix or BLOSUM matrix. [Six 
general classes of amino acid side chains have been categorized and include: 
Class I (Cys); Class II (Ser, Thr, Pro, Ala, Gly); Class III (Asn, Asp, Gin, Glu); 
Class IV (His, Arg, Lys); Class V (He, Leu, Val, Met); and Class VI (Phe, Tyr, 
Tip). For example, substitution of an Asp for another class III residue such as 

15 Asn, Gin, or Glu, is a conservative substitution. 

"Non-conservative substitution " - refers to the substitution of an amino acid in 
one class with an amino acid from another class; for example, substitution of an 
Ala, a class II residue, with a class III residue such as Asp, Asn, Glu, or Gin. 

20 

"Chemically modified" - when referring to the product of the invention, means a 
product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post- trans lational modifications, or 
by chemical modification techniques which are well known in the art. Among 
25 the numerous known modifications typical, but not exclusive examples include: 
acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor 
formation, covalent attachment of a lipid or lipid derivative, methylation, 
myristlyation, pegylation, prenylation, phosphorylation, ubiqutination, or any 
similar process. 

30 
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" Biologically active' 9 - refers to the variant product having some sort of 
biological activity, for example, some physiologically measurable effect on target 
cells, molecules or tissues. 

5 "Immunologically active" defines the capability of a natural, recombinant or 
synthetic varient product, or any fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, an immunologically active fragment of variant product 
denotes a fragment which retains some or all of the immunological properties of 
10 the variant product, e.g can bind specific anti-variant product antibodies or which 
can elicit an immune response which will generate such antibodies or cause 
proliferation of specific immune cells which produce variant. 

"Optimal alignment" - is defined as an alignment giving the highest percent 
15 identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local alignment program 
LALIGN using a ktup of 1, default parameters and the default PAM. A preferred 
alignment is the one performed using the CLUSTAL-W program from 
Mac Vector (TM), operated with an open gap penalty of 10.0, an extended gap 
20 penalty of 0.1, and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 
identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 
sequences that are in the "gap" of the first sequence). In case of alignments of 
25 known gene sequences with that of the new variant, the optimal alignment 
invariably included aligning the identical parts of both sequences together, then 
keeping apart and unaligned the sections of the sequences that differ one from the 
other. 
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" Having at least 90% identity" - with respect to two amino acid or nucleic acid 
sequence sequences, refers to the percentage of residues that are identical in the 
two sequences when the sequences are optimally aligned. Thus, 90% amino acid 
sequence identity means that 90% of the amino acids in two or more optimally 
5 aligned polypeptide sequences are identical, however this definition explicitly 
excludes sequences which are 100% identical with the original sequence from 
which the variant of the invention was varied. 

"Isolated nucleic acid molecule having an variant nucleic acid sequence" - is a 
10 nucleic acid molecule that includes the coding variant nucleic acid sequence. Said 
isolated nucleic acid molecule may include the variant nucleic acid sequence as 
an independent insert; may include the variant nucleic acid sequence fused to an 
additional coding sequences, encoding together a fusion protein in which the 
variant coding sequence is the dominant coding sequence (for example, the 
15 additional coding sequence may code for a signal peptide); the variant nucleic 
acid sequence may be in combination with non-coding sequences, e.g., introns or 
control elements, such as promoter and terminator elements or 5 ? and/or 3' 
untranslated regions, effective for expression of the coding sequence in a suitable 
host; or may be a vector in which the variant protein coding sequence is a 
20 heterologous. 

"Expression vector" - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many prokaryotic and 
eukaryotic expression vectors are known and/or commercially available. 
25 Selection of appropriate expression vectors is within the knowledge of those 
having skill in the art. 

"Deletion" - is a^change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 



30 
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"Insertion" or "addition" - is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence. 

5 "Substitution" - replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. As regards amino acid 
sequences the substitution may be conservative or non- conservative. 

"Antibody" - refers to IgG, IgM, IgD, IgA, or IgG antibody. The definition 
10 includes polyclonal antibodies or monoclonal antibodies. This term refers to 
whole antibodies or fragments of the antibodies comprising the antigen-binding 
domain of the anti-variant product antibodies, e.g. antibodies without the Fc 
portion, single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 

15 

Distinguishing antibody" — an antibody capable of binding to the variant product 
and not the original amino acid sequence from which it has been varied, or an 
antibody capable of binding to the original nucleic acid sequence and not to the 
variant production. 

"Activator" - as used herein, refers to a molecule which mimics the effect of the 
natural variant product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the natural 
product. The mechanism may be by any mechanism known to prolonging 
25 activities of biological molecules such as binding to receptors; prolonging the 
lifetime of the molecules; increasing the activity of the molecules on its target; 
increasing the affinity of molecules to its receptor; inhibiting degradation or 
proteolysis of the~molecules, or mimicking the biological activity of the variants 
on their targets, etc. Activators may be polypeptides, nucleic acids. 
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carbohydrates, lipids, or derivatives thereof n or any other molecules which can 
bind to and activate the variant product. 

"Deactivator" or ("Inhibitor") - refers to a molecule which modulates the 
5 activity of the variant product in an opposite manner to that of the activator, by 
decreasing or shortening the duration of the biological activity of the variant 
product. This may be done by any mechanism known to deactivate or inhibit 
biological molecules such as block of the receptor, block of active site, 
competition on binding site in target, enhancement of degradation, etc. 
10 Deactivators may be polypeptides, nucleic acids, carbohydrates, lipids, or 
derivatives thereof, or any other molecules which bind to and modulate the 
activity of said product. 

"Treating a disease" - refers to administering a therapeutic substance effective 
15 to ameliorate symptoms associated, with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring. 

"Detection" - refers to a method of detection of a disease, disorder, pathological 
or normal condition. This term may refer to detection of a predisposition to a 
20 disease as well as for establishing the prognosis of the patient by determining the 
severity of the disease. 

"Probe" - the variant nucleic acid sequence, or a sequence complementary 
therewith, when used to detect presence of other similar sequences in a-sample. 
25 The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

"Original sequence" - the amino acid or nucleic acid sequence from which the 
30 variant of the invention have been varied as a result of alternative slicing. 
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SUMMARY OF THE INVENTION 

The present invention is based on the finding of several novel, naturally 
occurring splice variants, which are naturally occurring sequences obtained by 
alternative splicing of known genes. The novel splice variants of the invention are 
5 not merely truncated forms, fragments or mutations of known genes, but rather 
novel sequences which naturally occur within the body of individuals. 

The term "alternative splicing" in the context of the present invention and 
claims refers to: intron inclusion, exon exclusion, addition or deletion of terminal 
sequences in the variant as compared to the original sequences, as well as to the 

10 possibility of "intron retention". Intron retention is an intermediate stage in the 
processing of RNA transcripts, where prior to production of fully processed mRNA 
the intron (naturally spliced in the original sequence) is* retained in the variant. 
These intermediately processed RNAs may have physiological significance and are 
also within the scope of the invention. 

15 The novel variant products of the invention may have the same 

physiological activity as the original peptide from which they have been varied 
(although perhaps at a different level); may have an opposite physiological activity 
from the activity featured by the original peptide from which they are varied; may 
have a completely different, unrelated activity to the activity of the original from 

20 which they are varied; or alternatively may have no activity at all and this may lead 
to various diseases or pathological conditions. 

The novel variants may also serve for detection purposes, i.e. their presence 
or level may be indicative of a disease, disorder, pathological or normal condition 
or alternatively the ratio between the level variants and the level original peptide 

25 from which they were varied, or the ratio to other variants may be indicative to a 
disease, disorder, pathological or normal condition. 

For example, for detectional purposes, it is possible to establish differential 
expression of various variants in various tissues. A certain variant may be 
expressed mainly in one tissue, while the original sequence from which it has been 

30 varied, or another variant may, be expressed mainly in another tissue. 
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Understanding of the distribution of the variants in various tissues may be helpful 
in basic research, for understanding the physiological function of the genes as well 
as may help in targeting pharmaceuticals or developing pharmaceuticals. 

The study of the variants may also be helpful to distinguish various stages in 
5 the life cycles of the same type of cells which may also be helpful for development 
of pharmaceuticals for various pathological conditions in which cell cycles is 
non-normal, notably cancer. 

Thus the detection may by determination of the presence or the level of 
expression of the variant within a specific cell population, comprising said presence 

10 or level between various cell types in a tissue, between different tissues and 
between individuals. 

Thus the present invention provides by its first aspect, a novel isolated 
nucleic acid molecule comprising or consisting of any one of the coding sequence 
SEQ ID NO: 1 to SEQ ID NO: 17, fragments of said coding sequence having at 

15 least 20 nucleic acids (provided that said fragments are continuous stretches of 
nucleotides not present in the original sequence from which the variant was varied), 
or a molecule comprising a sequence having at least 90%, identity to SEQ ID 
NO: 1 to SEQ ID NO: 17, provided that the molecule is not completely identical to 
the original sequence from which the variant was varied. 

20 The present invention further provides a protein or polypeptide comprising 

or consisting of an amino acid sequence encoded by any of the above nucleic acid 
sequences, termed herein "variant product 7 *, for example, an amino acid sequence 
having the sequence as depicted in any one of SEQ ID NO: 18 to SEQ ID NO: 34, 
fragments of the above amino acid sequence having a length of at least 10 amino 

25 acids coded by the above fragments of the nucleic acid sequences, as well as 
homologues of the above amino acid sequences in which one or more of the amino 
acid residues has been substituted (by conservative or non-conservative 
^ substitution) added; deleted, or chemically modified. 

The deletions, insertions and modifications should be in regions, or adjacent 

30 to regions, wherein the variant differs from the original sequence. 
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For example, where the variant is different from the original sequence by 
addition of a short stretch of 10 amino acids, in the terminal or non- terminal 
portion of the peptide, the invention also concerns homologues of that variant 
where the additional short stretch is altered for example, it includes only 8 
5 additional amino acids, includes 13 additional amino acids, or it includes 10 
additional amino acids, however some of them being conservative or 
non-conservative substitutes of the original additional 10 amino acids of the novel 
variants. In all cases the changes in the homolog, as compared to the original 
sequence, are in the same regions where the variant differs from the original 
io sequence, or in regions adjacent to said region. 

Another example is where the variant lacks a non-terminal region (for 
" example of 20 amino acids) which is present in the original sequence (due for 
example to exon exclusion). The homologues may lack in the same region only 17 
amino acids or 23 amino acids. Again the deletion is in the same region where the 
15 variant lacks a sequence as compared to the original sequence, or in a region 
adjacent thereto. 

It should be appreciated that once a man versed in the art's attention is 
directed to the importance of a specific region, due to the fact that this region 
differs in the variant as compared to the original sequence, there is no problem in 

20 derivating said specific region by addition to it, deleting from it, or substituting 
some amino acids in it. Thus homologues of variants which are derivated from the 
variant by changes (deletion, addition, substitution) only in said region as well as in 
regions adjacent to it are also a part of the present invention. Generally, if the 
variant is distinguished from the original sequence by some sort of physiological 

25 activity, then the homolog is distinguished from the original sequence in essentially 
the same manner. 

The present invention further provides nucleic acid molecule comprising or 
consisting of a sequence which encodes the above amino acid sequences, 
(including the fragments and homologues of the amino acid sequences). Due to the 
30 degenerative nature of the genetic code, a plurality of alternative nucleic acid 
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sequences, beyond those depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 17, 
can code for the amino acid sequence of the invention. Those alternative nucleic 
acid sequences which code for the same amino acid sequences codes by the 
sequence SEQ ID NO: 1 to SEQ ID NO: 17 are also an aspect of the of the present 
5 invention. 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 
transfected by said vectors. 

The present invention still further provides pharmaceutical compositions 

10 comprising, as an active ingredient, said nucleic acid molecules, said expression 
vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable forthe treatment of diseases 
and pathological conditions, which can be ameliorated or cured by raising the level 
of any one of the variant products of the invention. 

15 By a second aspect, the present invention provides a nucleic acid molecule 

comprising or consisting of a non-coding sequence which is complementary to that 
of any one of SEQ ID NO: 1 to SEQ ID NO: 17, or complementary to a sequence 
having at least 90% identity to said sequence (with the proviso added above) or a 
fragment of said two sequences (according to the above definition of fragment). 

20 The complementary sequence may be a DNA sequence which hybridizes with any 
one of SEQ of ID NO: 1 to SEQ ID NO: 17 or hybridizes to a portion of that 
sequence having a length sufficient to inhibit the transcription of the 
complementary sequence. The complementary sequence may be a DNA sequence 
which can be transcribed into an mRNA being an antisense to the mRNA 

25 transcribed from any one of SEQ ID NO: 1 to SEQ ID NO: 17 or into an mRNA 
which is an antisense to a fragment of the mRNA transcribed from any one of SEQ 
ID NO: 1 to SEQ ID NO: 17 whieh has a length sufficient to hybridize with the 
mRNA transcribed from SEQ ID NO: 1 to SEQ ID NO: 17, so as to inhibit its 
translation. The complementary sequence may also be the mRNA or the fragment 

30 of the mRNA itself. 
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The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example as probes used for the detection 
of the variants of the invention. The presence of the variant transcript or the level of 
the variant transcript may be indicative of a multitude of diseases, disorders and 
5 various pathological as well as normal conditions. In addition or alternatively, the 
ratio of the level of the transcripts of the variants of the invention may also be 
compared to that of the transcripts of the original sequences from which have been 
varied, or to the level of transcript of other variants, and said ratio may be 
indicative to a multitude of diseases, disorders and various pathological and normal 
10 conditions. 

The present invention also provides expression vectors comprising any one 
of the above defined complementary nucleic acid sequences and host cells 
transfected with said nucleic acid sequences or vectors, being complementary to 
those specified in the first aspect of the invention. 

15 The invention also provides anti-variant product antibodies, namely 

antibodies directed against the variant product which specifically bind to said 
variant product. Said antibodies are useful both for diagnostic and therapeutic 
purposes. For example said antibody may be as an active ingredient in a 
pharmaceutical composition as will be explained below. 

20 The present invention also provides pharmaceutical compositions 

comprising, as an active ingredient, the nucleic acid molecules which comprise or 
consist of said complementary sequences, or of a vector comprising said 
complementary sequences. The pharmaceutical composition thus provides 
pharmaceutical compositions comprising, as an active ingredient, said anti-variant 

25 product antibodies. 

The pharmaceutical compositions comprising said anti-variant product 
antibodies or the nucleic acid molecule comprising said complementary sequence, 
are suitable for the treatment of diseases and pathological conditions where a 
therapeutically beneficial effect may be achieved by neutralizing the variant (either 

30 at the transcript or product level) or decreasing the amount of the variant product or 
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blocking its binding to its target, for example, by the neutralizing effect of the 
antibodies, or by the effect of the antisense mRNA in decreasing the expression 
level of the variant sequence. 

According to the third aspect of the invention the present invention provides 
5 methods for detecting the level of the transcript (mRNA) of said variant product in 
a body fluid sample, or in a specific tissue sample, for example by use of probes 
comprising or consisting of said coding sequences; as well as methods for detecting 
levels of expression of said product in tissue, e.g. by the use of antibodies capable 
of specifically reacting with the variant products of the invention. Detection of the 

10 level of the expression of the variant of the invention in particular as compared to 
that of the original sequence from which it was varied or compared to other variant 
sequences all varied from the same original sequence "may be indicative of a 
plurality of physiological or pathological conditions. 

The method, according to this latter aspect, for detection of a nucleic acid 

15 sequence which encodes the variant product in a biological sample, comprises the 
steps of: 

(a) providing a probe comprising at least one of the nucleic acid 
sequences defined above; 

(b) contacting the biological sample with said probe under conditions 
20 allowing hybridization of nucleic acid sequences thereby enabling formation of 

hybridization complexes; 

(c) detecting hybridization complexes, wherein the presence of the 
complexes indicates the presence of nucleic acid sequence encoding the variant 
product in the biological sample. 

25 The method as described above is qualitative, i.e. indicates whether the 

transcript is present in or absent from the sample. The method can also be 
quantitative, by determining the level of hybridization complexes and then 
calibrating said levels to determining levels of transcripts of the desired variant in 
the sample. 
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Both qualitative and quantitative determination methods can be used for 
diagnostic, prognostic and therapy planning purposes. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
detection purposes, i.e. the probe is a part of an array of probes each present in a 
5 known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 
sequence an RNA sequence, etc; it may be a coding or a sequence or a sequence 
complementary thereto (for respective detection of RNA transcripts or 
coding-DNA sequences). By quantization of the level of hybridization complexes 
10 and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 

Methods for detecting mutations in the region coding for the variant product 
are also provided, which may be methods carried-out in a binary fashion, namely 
merely detecting whether there is any mismatches between the normal variant 
15 nucleic acid sequence of the invention and the one present in the sample, or 
carried-out by specifically detecting the nature and location of the mutation. 

The present invention also concerns a method for detecting variant product 
in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of the invention, 
20 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of variant product in said biological sample. 

Many diseases are diagnosed by detecting the presence of antibodies against 
25 a protein characterizing the disease in the blood, serum or any other body fluid of 
the patient. The present invention also concerns a method for detecting anti-variant 
antibody in a biological sample, comprising: 

(a) contacting said sample with the variant product of the invention, 
thereby forming an antibody-antigen complex; and 
30 (b) detecting said antibody-antigen complex 
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wherein the presence of said antibody-antigen complex correlates with the 
presence of anti-variant antibody in the sample. 

As indicated above, both methods (for detection of variant product and for 
detection of the anti-variant antibody) can be quantitized to determine the level or 
5 the amount of the variant or antibody in the sample, alone or in comparison to the 
level of the original amino acid sequence from which it was varied or compared to 
the level of antibodies against the original amino acid sequence, and qualitative and 
quantitative results may be used for diagnostic, prognostic and therapy planning 
purposes. 

10 The invention also concerns distinguishing antibodies, i.e. antibodies 

capable of binding either to the variant product or to the original sequence from 
which the variant has been varied, while not binding to theToriginal sequence or the 
variant product respectively. These distinguishing antibodies may be used for 
detection purposes. 

15 By yet another aspect the invention also provides a method for identifying 

candidate compounds capable of binding to the variant product and modulating its 
activity (being either activators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an amino acid 
sequence substantially as depicted in any one of SEQ ED NO: 18 to 34, or a 

20 fragment of such a sequence; 

(ii) contacting a candidate compound with said amino acid sequence; 

(iii) measuring the physiological effect of said candidate compound on 
the activity of the amino acid sequences and selecting those compounds which 
show a significant effect on said physiological activity. 

25 The present invention also concerns compounds identified by the above 

methods described above, which compound may either be an activator of the 
variant product or a deactivator thereof. 



- 17- 

BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompanying drawings, in which: 
5 Fig. 1 is a comparison between the amino acid sequence of SEQ ID NO: 18 

and the original sequence from which it has been varied; • 

Fig. 2 is a comparison between the amino acid sequence of SEQ ID NO: 19 
and the original sequence from which it has been varied; 

Fig. 3 is a comparison between the amino acid sequence of SEQ ID NO: 20 
io and the original sequence from which it has been varied; 

Fig. 4 is a comparison between the amino acid sequence of SEQ ED NO: 21 
and the original sequence from which it has been varied; ** 

Fig. 5 is a comparison between the amino acid sequence of SEQ ID NO: 22 
and the original sequence from which it has been varied; 
15 Fig. 6 is a comparison between the amino acid sequence of SEQ ID NO: 23 

and the original sequence from which it has been varied; 

Fig. 7 is a comparison between the amino acid sequence of SEQ ID NO: 24 
and the original sequence from which it has been varied; 

Fig. 8 is a comparison between the amino acid sequence of SEQ ED NO: 25 
20 and the original sequence from which it has been varied; 

Fig. 9 is a comparison between the amino acid sequence of SEQ ID NO: 26 
and the original sequence from which it has been varied; 

Fig. 10 is a comparison between the amino acid sequence of SEQ ID 
NO: 27 and the original sequence from which it has been varied; 
25 Fig. 11 is a comparison between the amino acid sequence of SEQ ID 

NO: 28 and the original sequence from which it has been varied; 

Fig. 12 is a comparison between the amino acid sequence of SEQ' ID 
NO: 29 and the original sequence from which it has been varied; 

Fig. 13 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 30 and the original sequence from which it has been varied; 
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Fig. 14 is a comparis'on between the amino acid sequence of SEQ ID 
NO: 3 1 and the original sequence from which it has been varied; 

Fig. 15 is a comparison between the amino acid sequence of SEQ ED 
NO: 32 and the original sequence from which it has been varied; 
5 Fig. 16 is a comparison between the amino acid sequence of SEQ ID 

NO: 33 and the original sequence from which it has been varied; 

Fig. 17 is a comparison between the amino acid- sequence of SEQ ED 
NO: 34 and the original sequence from which it has been varied. 

io DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example I: Comparison of variants with original sequences 

- Original sequences were obtained from GenBank Version 110. 
Comparison between the original sequences and the noval variant sequences was 
made using the Pileup application from the GCG suite version 10.0 (January 
15 1999), with the defalut values: 

Gap creation penalty (Gap Weight): 8 

Gap extension penalty (GapLength Weight): 2 

The comparison is shown in Fig. 1 to 17 which show the comparison of 
each of the variant products depicted in SEQ ED, NO: 18 to 34 with the original 
20 sequence from which it was varied. - 

The following is a table that compares the sequences of the variants of the 
invention to the original sequences from which they were varied and indicates 
ehere the variant differs from the original sequence. The terminology NV-1 to 
NV-17 corresponds to SEQ ID NO: 1 to SEQ ID NO: 17. 

25 
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TABLE 



Accession # 


New Variant # 


Description of the new variant 


BPI HUMAN 


NV-1 


Replacement of the last C-terminal 92 aa of the 
protein with alternative 100 aa, which has an 
additional potential transmembrane domain. 


BAL_HUMAN 


NV-2 


Replacement of 121 C-terminal amino acids of the 
original protein by alternative 7 amino acids. 


FA12_HUMAN 


NV-3 


Insertion of 30 amino acids in the catalytic domain of 
the original nrotein 


TIM 1 _HUM AN 


NV-4 


Replacement of 52 C-terminal amino acids of the 
(Sri ffcn rial nrotein bv alternative 9 amino ariHs 

V/l 1^^111(11 pi Ulvlll J UllVlllUllYV S CL1 1 i 11 1 1_/ C\ \^ lUj . 

Missing 3 disulfide bonds. 


TTM 1 HT JMAN 


NV-5 

1^1 V 


R pn]^ff*mf*nt of Q4. ("^-terminal aminn ar*iHs nf the 
original nrotein bv alternative 10 amino acids I-Fas 
only 1 out of 5 diS bonds, has an extra Cys in the 
added 10 amino acids. 


TTM1 HUMAN 


NV-6 


Renlacement of one of the exons of the original 
protein by a homologous exon. 


TTM1 HTIMAN 


NV-7 


Deletion of 47 amino acids of the original nrotein 
Missing 2 disulfide bonds. 


TTM1 HUMAN 


NV-8 


Deletion of 10 amino acids of the onoinal nrotein 

Missing one out of 2 glycosylation sites. 


UROK HUMAN 


NV-9 


Insertion of 62 amino acids in long A chain. 


PAI1_HUMAN 


NV-10 


Replacement of 140 C-terminal amino acids of the 
original protein, including the active site and 2 out of 
3 glycosylation sites, by alternative 12 amino acids. 


PAT) 'PTTiVfAXr 


MVJ 1 

IN V"l 1 


R <=»r"\l appmpnt nf 17 P.tprrn insi 1 54minn aciHc nf tfi^* 

XVCUlcLlfGlIlClll Ui 1 / V-> Iwl lllxlLCLl <XlHilx\J Uv 1UO Ui 11 1G 

original protein by alternative 14 amino acids. 


CTGF_HUMAN 


NV-12 


Deletion of 32 amino acids, at positions 152-185 of 

tl*tf» m*icrinsil Mi"ntein 


DCC_HUMAN 


NV-1 3 


Replacement of 1118 C-terminal amino acids of the 

nri cr i n q 1 nrntpin inr*lnHincT rvtnnl/j^mir rlompin 

VJl iglJllCLl ^JIvJLGlll, llll/lUUlllg Wj' LVJJJ Idol 1.1 IV^ 111C1111 , 

transmembrane region and part of the extracellular 
domain, by alternative 12 amino acoids. The deleted 
region contains all 5 fibronectin type domains and 1 
out of the 4 Ig-like C2 type domain. 


DCC_HUMAN 


NV-14 


Replacement of 28 C-terminal amino acids from the 
cytoplasmic domain of the original protein by 
alternative 13 amino acids. 


MGR4_HUMAN 


NV-1 5 


Deletion of 58 amino acids from the extracellular 
domain of the original protein. 


MCR_HUMAN 


NV-1 6 


Replacement of 147 C-terminal amino acids, 
including part of the steroid "binding domain of the 
original protein, by alternative 8 amino acids. 


MCR_HUMAN 


■NV-1 7 


Deletion of 147 C-terminal amino acids, including 
part of the steroid binding domain of the original 
protein. 
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Example XI: Designation of the original sequences 

Each novel variant of the invention is varied from an original sequence 
which has a known designation. The designation of the RNA sequences of the 
original sequences from which it was varied and the Accession Number of the 
5 original sequence are given below. First, information concerning the original 
sequence is given and then designation of the novel variants of the invention is 
given as NV-1 to NV-17 corresponding to SEQ ID NO: 1 to SEQ ID NO: 17. 

B PI HUMAN 

10 

BACTERICIDAL PERMEABILITY-INCREASING PROTEIN 

FUNCTION : THE CYTOTOXIC ACTION OF BPI IS-LIMITED TO MANY 
15 SPECIES OF GRAM-NEGATIVE BACTERIA; THIS SPECIFICITY MAY BE 

EXPLAINED BY A STRONG AFFINITY OF THE VERY BASIC 

N-TERMTNAL HALF FOR THE NEGATIVELY CHARGED 

LIPOPOLYSACCHARIDES THAT ARE UNIQUE TO THE 

GRAM-NEGATIVE BACTERIAL OUTER ENVELOPE. 
20 SUBCELLULAR LOCATION : MEMBRANE-ASSOCIATED IN 

POLYMORPHONUCLEAR LEUKOCYTES (PMN) GRANULES. 

TISSUE SPECIFICITY : RESTRICTED TO CELLS OF THE MYELOID 

SERIES. 

DOMAIN : THE N-TERMTNAL REGION MAY BE EXPOSED TO THE 
25 INTERIOR OF THE GRANULE, WHEREAS THE C-TERMTNAL PORTION 
MAY BE EMBEDDED IN THE MEMBRANE. DURING PHAGOCYTOSIS 
AND DEGRANULATION, PROTEASES MAY BE RELEASED AND 
ACTIVATED AND CLEAVE BPI AT THE JUNCTION OF THE N- AND 
C-TERMTNAL PORTIONS OF THE MOLECULE, PROVIDING 
30 CONTROLLED RELEASE OF THE N-TERMTNAL ANTIBACTERIAL 
FRAGMENT WHEN BACTERIA ARE INGESTED. 



NV-1: 



35 
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BAL HUMAN 

BILE-SALT-ACTIVATED LIPASE (BAL) 
CHOLESTEROL ESTERASE 

5 

FUNCTION : CATALYZES FAT AND VITAMIN ABSORPTION. ACTS IN 
CONCERT WITH PANCREATIC LIPASE AND COLIPASE FOR THE 
COMPLETE DIGESTION OF DIETARY TRIGLYCERIDES. 
CATALYTIC ACTIVITY : TRIACYLGLYCEROL + H(2)0 
10 DIACYLGLYCEROL + A FATTY ACID ANION. 

CATALYTIC ACTIVITY : A STERYL ESTER + H(2)0 = A STEROL + A 
FATTY ACID. 

ENZYME REGULATION : ACTIVATED BY BILE SALTS CONTAINING A 
7-HYDROXYL GROUP. 
15 TISSUE SPECIFICITY : MAMMARY GLAND, AND PANCREAS- 
SIMILARITY : BELONGS TO THE TYPE-B CARBOXYLESTERASE/LIPASE 
FAMILY. 

NV-2: 



FA12 HUMAN 
COAGULATION FACTOR XII 

25 

FUNCTION : FACTOR XII IS A SERUM GLYCOPROTEIN THAT 
PARTICIPATES IN THE INITIATION OF. BLOOD COAGULATION, 
FIBRINOLYSIS, AND THE GENERATION OF BRADYKININ AND 
ANGIOTENSIN. 

30 CATALYTIC ACTIVITY : CLEAVES SELECTIVELY ARG-|-ILE BONDS 
AND ACTIVATES COAGULATION FACTORS VII AND XI. 
PTM : O- AND N-GLYCOSYLATED. 

DISEASE : DEFECTS IN F12 DO NOT CAUSE ANY CLINICAL 
SYMPTOMS. THE SOLE EFFECT IS THAT WHOLE-BLOOD CLOTTING 

35 TIME IS PROLONGED. 

MISCELLANEOUS : FACTOR XII, PREKALLIKREIN, AND HMW 
KINTNOGEN FORM A COMPLEX BOUND TO AN ANIONIC SURFACE. 
PREKALLIKREIN IS CLEAVED BY FACTOR X3I TO FORM KALLIKREIN, 
WHICH THEN CLEAVES FACTOR XII FIRST TO ALPHA-FACTOR XTIA 

40 AND THEN TO BETA-FACTOR XII A. ALPHA-FACTOR XIIA ACTIVATES 
FACTOR XI TO FACTOR XIA. 



NV-3: 
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TIM1 HUMAN 
METALLOPROTEINASE INHIBITOR 1 

5 

FUNCTION : COMPLEXES WITH METALLOPROTEINASES (SUCH AS 
COLLAGENASES) AND IRREVERSIBLY INACTIVATE THEM. ALSO 
MEDIATES ERYTHROPOIESIS IN VITRO; BUT, UNLIKE IL-3, IT IS 
SPECIES-SPECIFIC, STIMULATING THE GROWTH AND 
io DIFFERENTIATION OF ONLY HUMAN AND MURINE ERYTHROID 
PROGENITORS. 

PTM : THE ACTIVITY OF TIMP-1 IS DEPENDENT ON THE PRESENCE OF 
DISULFIDE BONDS. 

SIMILARITY : BELONGS TO- THE TIMP FAMILY. 

15 

NV-4: ... 



20 TIM1 HUMAN 

METALLOPROTEINASE INHIBITOR 1 

25 NV-5 : 



TIM1 HUMAN 
METALLOPROTEINASE INHIBITOR 1 



NV-6 : 

35 TIM1_HUMAN 

METALLOPROTEINASE INHIBITOR 1 



40 NV-7 
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T1M1HUMAN 
METALLOPROTEINASE INHIBITOR 1 



5 

NV-8 

UROK HUMAN 

10 

UROKINASE-TYPE PLASMINOGEN ACTIVATOR 

FUNCTION : POTENT PLASMINOGEN ACTIVATOR AND IS CLINICALLY 
15 USED FOR THERAPY OF THROMBOLYTIC DISORDERS. 

CATALYTIC ACTIVITY : SPECIFIC CLEAVAGE OF ARG-|-VAL BOND IN 
PLASMINOGEN TO FORM PLASMIN. 

SUBUNIT : FOUND IN HIGH AND LOW MOLECULAR MASS FORMS. 
EACH CONSISTS OF TWO CHAINS, A AND B. THE HIGH MOLECULAR 
20 MASS FORM CONTAINS A LONG CHAIN A. CLEAVAGE OCCURS 
AFTER RESIDUE 155 IN THE LOW MOLECULAR MASS FORM TO YIELD 
A SHORT Al CHAIN. 

PHARMACEUTICAL : AVAILABLE UNDER THE NAME ABBOKINASE 
(ABBOTT). USED IN PULMONARY EMBOLISM (PE) TO INITIATES 
25 FIBRINOLYSIS. 



NV-9: 



30 P All HUMAN 

PLASMINOGEN ACTIVATOR INHIBITOR-1. ENDOTHELIAL 

FUNCTION : THIS INHIBITOR ACTS AS "BAIT" FOR TISSUE 
35 PLASMINOGEN ACTIVATOR, UROKINASE, AND PROTEIN C. ITS RAPID 
INTERACTION WITH TP A MAY FUNCTION AS A MAJOR CONTROL 
POINT IN THE REGULATION OF FIBRINOLYSIS. 

DISEASE : HIGH CONCENTRATIONS OF THIS PROTEIN HAVE BEEN 
ASSOCIATED WITH HUMAN THROMBOEMBOLIC DISEASE. 
40 MISCELLANEOUS : PAH IS INACTIVATED BY PROTEOLYTIC ATTACK 
OF THE UROKINASE-TYPE (U-PA) AND THE TISSUE-TYPE (TP A), 
CLEAVING THE 369(R)-370(M) BOND. 



NV-10: 



PAI1 HUMAN 



PLASMINOGEN ACTIVATOR INHIBITOR-1, ENDOTHELIAL 
NV-11: 



CTGF HUMAN 

CONNECTIVE TISSUE GROWTH FACTOR 

FUNCTION : MAJOR CONNECTIVE TISSUE MITOATTRACTANT 
SECRETED BY HUMAN VASCULAR ENDOTHELIAL CELLS. THIS 
IMMEDIATE-EARLY PROTEIN MAY BIND ONE OF THE PDGF CELL 
SURFACE RECEPTORS. 
SUBUNIT : MONOMER. 

ALTERNATIVE PRODUCTS: A SHORTER FORM MAY BE PRODUCED 
BY ALTERNATIVE SPLICING OF THE SAME GENE: VARSPLIC 172 
198 MISSING (IN SHORT FORM). 

SIMILARITY : BELONGS TO THE INSULIN-LIKE GROWTH FACTOR 
BINDING PROTEIN FAMILY. CEF-10/CYR61/CTFG/FISP-12/NOV 

NV-12: 



DCC HUMAN 

TUMOR SUPPRESSOR PROTEIN DCC 

FUNCTION : IMPLICATED AS A TUMOR SUPPRESSOR GENE. 
SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
TISSUE SPECIFICITY : FOUND IN AXONS OF THE CENTRAL AND 
PERIPHERAL NERVOUS SYSTEM AND IN DIFFERENTIATED CELL 
TYPES OF THE INTESTINE. 

DISEASE : COLORECTAL TUMORS THAT LOST THEIR CAPACITY TO 
DIFFERENTIATE INTO MUCUS PRODUCING CELLS UNIFORMLY LACK 
DCC EXPRESSION. INACTIVATION OF DCC DUE TO ALLELIC 
DELETION AND/OR POINT MUTATIONS MAY CAUSE BOTH 
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LYMPHATIC AND HEMATOGENOUS METASTASIS OF OESOPHAGEAL 
SQUAMOUS CELL CARCINOMAS. 

SIMILARITY : CONTAINS 4 IMMUNOGLOBUL IN-LIKE C2-TYPE 
DOMAINS. 

5 SIMILARITY : CONTAINS 6 FIBRONECTIN TYPE HI-LIKE DOMAINS. 



NV-13: 



10 

DCCHUMAN 
TUMOR SUPPRESSOR PROTEIN DCC 



NV-14: 



MGR4HUMAN 

20 

METABOTROPIC CLUTAMATE RECEPTOR 4 

FUNCTION : RECEPTOR FOR GLUTAMATE. THE ACTIVITY OF THIS 

RECEPTOR IS MEDIATED BY A G-PROTEIN THAT INHIBITS 
25 ADENYLATE CYCLASE ACTIVITY. 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN. 

TISSUE SPECIFICITY : STRONGLY EXPRESSED IN THE CEREBELLUM. 

EXPRESSED AT LOW LEVELS IN HIPPOCAMPUS, HYPOTHALAMUS 

AND THALAMUS. NO EXPRESSION DETECTED IN LIVER. 
30 SIMILARITY : BELONGS TO FAMILY 3 OF G-PROTEIN COUPLED 

RECEPTORS. STRONGEST, TO MGLUR6. 



NV-15: 

35 

M CRHU M AN 
~ MINERALOCORTICOID RECEPTOR 

40 

FUNCTION : RECEPTOR FOR BOTH MTNERALOCORTICOIDS (MC) SUCH 
AS ALDOSTERONE AND GLUCOCORTICOIDS (GC) SUCH AS 
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CORTICOSTERONE OR CORTISOL. THE EFFECT OF MC IS TO 
INCREASE ION AND WATER TRANSPORT AND THUS RAISE 
EXTRACELLULAR FLUID VOLUME AND BLOOD PRESSURE AND 
LOWER POTASSIUM LEVELS. 
5 SUBCELLULAR LOCATION : NUCLEAR. 

DOMAIN : COMPOSED OF THREE DOMAINS: A MODULATING 
N-TERMINAL DOMAIN, A DNA-BINDING DOMAIN AND A 
C-TERMTNAL STEROID-BINDING DOMAIN. 

SIMILARITY : BELONGS TO THE NUCLEAR HORMONE RECEPTORS 
10 FAMILY. NR3 SUBFAMILY. 



NV-16: 

15 MCR HUMAN 

MINERALOCORTICOID RECEPTOR 

20 NV-17 : 

Example III: Variant nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
25 sequences which encode variant product and fragments and analogs thereof. The 
nucleic acid sequences may alternatively be sequences complementary to the 
above coding sequence, or to a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
30 of DNA, and include messenger RNA, synthetic RNA and DNA, cDNA, and 
genomic DNA. The DNA may be double-stranded or single-stranded, and if 
single-stranded may be the coding strand or the non-coding (anti-sense, 
complementary) strand. The nucleic acid sequences may also both include 
dNTPs, rNTPs as well as non naturally occurring sequences. The sequence may 
35 also be a part of -a hybrid between an amino acid sequence and a nucleic acid 
sequence. 
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In a general embodiment, the nucleic acid sequence has at least 90%, 
identity with any one of the sequence identified as SEQ ID NO: 1 to SEQ ID 
NO: 17 provided that this sequence is not completely identical with that of the 
original sequence. 

5 The nucleic acid sequences may include the coding sequence by itself. By 

another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fusion protein or signal peptides, in 
combination with non-coding sequences, such as introns and control elements, 
promoter and terminator elements or 5' and/or 3' untranslated regions, effective 

10 for expression of the coding sequence in a suitable host, and/or in a vector or host 
environment in which the variant nucleic acid sequence is introduced as a 
heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in- frame to a marker sequence which allows for 

15 purification of the variant product. The marker sequence may be, for example, a 
hexahistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host, or, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 

20 protein (Wilson, L, et al Cell 31:161 (1984)). 

Also included in the scope of the invention are fragments as defined above 
also referred to herein as oligonucleotides, typically having at least 20 bases, 
preferably 20-30 bases corresponding to a region of the coding-sequence nucleic 
acid sequence. The fragments may be used as probes, primers, and when 

25 complementary also as antisense agents, and the like, according to known 
methods. 

As indicated above, the nucleic acid sequence may be substantially a 
depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 17 or fragments thereof or 
sequences having at least 90% identity to the above sequence as explained above. 
30 Alternatively, due to the degenerative nature of the genetic code, the sequence 
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may be a sequence coding for any one of the amino acid sequence of SEQ ID 
NO: 18 to SEQ ID NO: 34, or fragments or analogs of said amino acid sequence. 

A. Preparation of nucleic acid sequences 

5 The nucleic acid sequences may be obtained by screening cDNA libraries 

using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the variant products disclosed above. cDNA libraries 
prepared from a variety of tissues are commercially available and procedures for 
screening and isolating cDNA clones are well-known to those of skill in the art. 

10 Such techniques are described in, for example, Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel FM et al. (1989) CurrenrProtocols in Molecular 
Biology, John Wiley & Sons, New York, N.Y. 

The nucleic acid sequences may be extended to obtain upstream and 

is downstream sequences such as promoters, regulatory elements, and 5 ! and 3 f 
untranslated regions (UTRs). Extension of the available transcript sequence may 
be performed by numerous methods known to those of skill in the art, such as 
PCR or primer extension (Sambrook et al, supra), or by the RACE method 
using, for example, the Marathon RACE kit (Clontech, Cat. # Kl 802-1). 

20 Alternatively, the technique of "restriction-site" PCR (Gobinda et al PCR 

Methods Applic. 2:318-22, (1993)), which uses universal primers to retrieve 
flanking sequence adjacent a known locus, may be employed. First, genomic 
DNA is amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are subjected to a second 

25 round of PCR with the same linker primer and another specific primer internal to 
the first one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using divergent 
primers based on a known region (Triglia, T. et al, Nucleic Acids Res. 16:8186, 

30 (1988)). The primers may be designed using OLIGO(R) 4.06 Primer Analysis 
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Software (1992; National Biosciences Inc, Plymouth, Minn.), r * or another 
appropriate program, to be 22-30 nucleotides in length, to have a GO content of 
50% or more, and to arlneal to the target sequence at temperatures about 68-72°C. 
The method uses several restriction enzymes to generate a suitable fragment in 
5 the known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 

Capture PCR (Lagerstrom, M. et al, PCR Methods Applic. 1:111-19, 
(1991)) is a method for PCR amplification of DNA fragments adjacent to a 
known sequence in human and yeast artificial chromosome DNA. Capture PCR 
10 also requires multiple restriction enzyme digestions and ligations to place an 
engineered double- stranded sequence into a flanking part of the DNA molecule 
before PCR. 

Another method which may be used to retrieve flanking sequences is that 
of Parker, J.D., et al, Nucleic Acids Res., 19:3055-60, (1991)). Additionally, one 

15 can use PCR, nested primers and PromoterFinder™ libraries to "walk in" genomic 
DNA (PromoterFinder™; Clontech, Palo Alto, CA). This process avoids the need 
to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 

20 will contain more sequences which contain the 5' and upstream regions of genes. 

A randomly primed library may be particularly useful if an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries are useful for 
extension into the 5' nontranslated regulatory region. 

The nucleic acid sequences and oligonucleotides of the invention can also 

25 be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined to form continuous sequences up to several hundred-bases. 
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3. Use of variant nucleic acid sequence for the production of 
variant products 

In accordance with the present invention, nucleic acid sequences specified 
5 above may be used as recombinant DNA molecules that direct the expression of 
variant products. 

As will be understood by those of skill in the art, it may be advantageous 
to produce variant product-encoding nucleotide sequences possessing codons 
other than those which appear in any one of SEQ ID NO: 1 to SEQ ID NO: 17 

io which are those which naturally occur in the human genome. Codons preferred 
by a particular prokaryotic or eukaryotic host (Murray, E. et aL Nuc Acids Res., 
17:477-508. (1989)) can be selected, for example, to increase the rate of variant 
product expression or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, than transcripts produced from naturally 

15 occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 
order to alter a variant product coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, processing 
and/or expression of the product. For example, alterations may be introduced 

20 using techniques which are well known in the art, e.g., site-directed mutagenesis, 
to insert new restriction sites, to alter glycosylation patterns, to change codon 
preference, etc. 

The present invention also includes recombinant constructs comprising 
one or more of the sequences as broadly described above. The constructs 

25 comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct further comprises 
regulatory sequences, including, for example, a-promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those 

30 of skill in the art, and .are commercially available. Appropriate cloning and 



-31- 

expression vectors for use with prokaryotic and eukaryotic hosts are also 
described in Sambrook, et aL, (supra). 

The present invention also relates to host cells which are genetically 
engineered with vectors of the invention, and the production of the product of the 
5 invention by recombinant techniques. Host cells are genetically engineered (i.e., 
transduced, transformed or transfected) with the vectors of this invention which 
may be, for example, a cloning vector or an expression vector. The vector may 
be, for example, in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional nutrient media modified as 

10 appropriate for activating promoters, selecting transformants or amplifying the 
expression of the variant nucleic acid sequence. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected 
for expression, and will be apparent to those skilled in the art. 

The nucleic acid sequences of the present invention may be included in 

15 any one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 
plasmids; vectors derived from combinations of plasmids and phage DNA, viral 
DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, 

20 any other vector may be used as long as it is replicable and viable in the host. 
The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the art. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 

25 skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such- promoters include: LTR or SV40 promoter, the E.coli lac or 
trp promoter, the phage lambda PL promoter, and other promoters known to 

30 control expression of genes in prokaryotic or eukaryotic cells or their viruses. 
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The expression vector also contains, a ribosome binding site.* for translation 
initiation, and a transcription terminator. The vector may also include 
appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
5 phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E.coli. 

The vector containing the appropriate DNA sequence as described above, 
as well as an appropriate promoter or control sequence, may be employed to 

10 transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.coli, 
Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells 
such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
293 or Bowes melanoma; adenoviruses; plant cells, etc. The selection of an 

15 appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected 
depending upon the use intended for the variant product. For example, when 
large quantities of variant product are needed for the induction of antibodies, 

20 vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, 
multifunctional E.coli cloning and expression vectors such as Bluescript(K) 
(Stratagene), in which the variant polypeptide coding sequence may be ligated 
into the vector in-frame with sequences for the ammo-terminal Met and the 

25 subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; 
pIN vectors (Van Heeke & Schuster J. Biol. Chem. 264:5503-5509, (1989)); pET 
vectors (Novagen, Madison WI); and the like. 

In the yeast Saccharomyces cerevisiae a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
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PGH may be used. For reviews, -see Ausubel et aL (supra) and Grant et aL, 

{Methods in Enzymology 153:516-544, (1987)). 

In cases where plant expression vectors are used, the expression of a 

sequence encoding variant product may be driven by any of a number of 
5 promoters. For example, viral promoters such as the 35S and 19S promoters of 

CaMV (Brisson et aL, Nature 310:511-514. (1984)) may be used alone or in 

combination with the omega leader sequence from TMV (Takamatsu et aL, 

EMBO J., 6:307-311, (1987)). Alternatively, plant promoters such as the small 

subunit of RUBISCO (Coruzzi et aL, EMBO J. 3:1671-1680, (1984); Broglie et 
10 aL, Science 224:838-843, (1984)); or heat shock promoters (Winter J and 

Sinibaldi R.M., Results ProbL Cell Differ., 17:85-105, (1991)) may be used. 

These constructs can be introduced into plant cells by direct DNA transformation 

or pathogen-mediated trans fection. For reviews of such techniques, see Hobbs S. 

or Murry L.E. (1992) in McGraw Hill Yearbook of Science and Technology, 
15 McGraw Hill, New York, N.Y., pp 191-196; or Weissbach and Weissbach (1988) 

Methods for Plant Molecular Biology, Academic Press, New York, N.Y., pp 

421-463. 

Variant product may also be expressed in an insect system. In one such 
system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a 

20 vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia 
larvae. The variant product coding sequence may be cloned into a nonessential 
region of the virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of variant coding sequence will render 
the polyhedrin gene inactive and produce recombinant virus lacking coat protein 

25 coat. The recombinant viruses are then used to infect S. frugiperda cells or 
Trichoplusia larvae in which variant protein is expressed (Smith et aL, J. ViroL 
46:584, (1983); Engelhard, E.K. etaL, Proc. Nat. Acad Set 91:3224-7, (1994)). 

In mammaKan host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, a 

30 variant product coding sequence may be ligated into an adenovirus 
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transcription/translation complex consisting the late promoter and tripartite 
leader sequence. Insertion in a nonessential El or E3 region of the viral genome 
will result in a viable virus capable of expressing variant protein in infected host 
cells (Logan and Shenk, Proc. Natl. Acad Sci. 81:3655-59, (1984). In addition, 
5 transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. 

Specific initiation signals may also be required for efficient translation of 
a variant product coding sequence. These signals include the ATG initiation 
codon and adjacent sequences. In cases where variant product coding sequence, 

io its initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct 

15 reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system in use (Scharf, D. et ah, 
(1994) Results Probl Cell Differ., 20:125-62, (1994); Bittner et al., Methods in 

20 Enzymol 153:516-544, (1987)). 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 

25 Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation 
(Davis, L., Dibner, M, and Battey, I. (1986) Basic Methods in Molecular 
Biology). Cell-free translation systems can also be employed to produce 
polypeptides using RNAs derived from the DNA constructs of the present 

30 invention. 
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A host cell strain may be chosen for its ability to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 
fashion. Such modifications of the protein include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and 
5 acylation. Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be important for correct insertion, folding and/or function. 
Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc. have specific 
cellular machinery and characteristic mechanisms for such post-translational 
activities and may be chosen to ensure the correct modification and processing of 

10 the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines whicir stably express variant 
product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 

15 Following the introduction of the vector, cells may be allowed to grow for 1-2 
days in an enriched media before they are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 
presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clumps of stably transformed cells can be 

20 proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell 
lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler M., et al, Cell 11:223-32, (1977)) and adenine 
phosphoribosyltransferase (Lowy L, et aL, Cell 22:817-23, (1980)) genes which 

25 can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr which confers resistance to methotrexate (Wigler M., et al, Proc. 
Natl Acad. Set -77:3567-70, (1980)); npt, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et al, J. Mol Biol, 

30 150:1-14, (1981)) and als or pat, which confer resistance to chlorsulfuron and 
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phosphinotricin acetyltransferase, respectively (Murry, supra). Additional 
selectable genes have been described, for example, trpB, which allows cells' to 
utilize indole in place of tryptophan or hisD, which allows cells to utilize histinol 
in place of histidine (Hartman S.C. and R.C. Mulligan, Proc. Natl. Acad. ScL 
5 85:8047-5 1, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP, being widely used not only to 
identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C.A. et al t 

10 Methods MoL Biol. , 55:121-131,(1995)). 

Host cells transformed with a nucleotide sequence encoding variant 
product may be cultured under conditions suitable for the expression and 
recovery of the encoded protein from cell culture. The product produced by a 
recombinant cell may be secreted or contained intracellularly depending on the 

15 sequence and/or the vector used. As will be understood by those of skill in the 
art, expression vectors containing nucleic acid sequences encoding variant 
product can be designed with signal sequences which direct secretion of variant 
product through a prokaryotic or eukaryotic cell membrane. 

The variant product may also be expressed as a recombinant protein with 

20 one or more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are not limited 
to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS 

25 extension/affinity purification system (Immunex Corp, Seattle, Wash.). The 
inclusion of a protease-cleavable polypeptide linker sequence between the 
purification domain and variant product is useful to facilitate purification. One 
such expression vector provides for expression of a fusion protein compromising 
a variant polypeptide fused to a polyhistidine region separated by an enterokinase 

30 cleavage site. The histidine residues facilitate purification on IMIAC 
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(immobilized metal ion affinity chromatography, as described in Porsfth, et~aL, 
Protein Expression and Purification, 3:263-281, (1992)) while the enterokinase 
cleavage site provides a means for isolating variant polypeptide from the fusion 
protein. pGEX vectors (Promega, Madison, Wis.) may also be used to express 
5 foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can easily be purified from lysed 
cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case 
of GST-fusions) followed by elution in the presence of free ligand. 

Following transformation of a suitable host strain and growth of the host 

10 strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 
for further purification. Microbial cells employed in expression of proteins can 

15 be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know to those skilled in the art. 

The variant products can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the art, including 

20 ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high 

25 performance liquid chromatography (HPLC) can be employed for final 
purification steps. * 

C. Diagnostic-applications utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention may be used for a 
30 variety of diagnostic purposes. The nucleic acid sequences may be used to detect 



-38- 

and quantitate expression of the variant irf patient's cells, e.g. biopsied tissues, by 
detecting the presence of mRNA coding for variant product. Alternatively, the 
assay may be used to detect soluble variant in the serum or blood. This assay 
typically involves obtaining total mRNA from the tissue or serum and contacting 
5 the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 
hybridizing with a sequence included within the sequence of a nucleic acid 
molecule encoding variant product under hybridizing conditions, detecting the 
presence of mRNA hybridized to the probe, and thereby detecting the expression 

io of variant. This assay can be used to distinguish between absence, presence, and 
excess expression of variant product and to monitor levels of variant expression 
during therapeutic intervention. In addition, the assay ma/ be used to compare the 
levels of the variant of the invention to the levels of the original sequence from 
which it has been varied or to levels of other variants, which comparison may 

15 have some physiological meaning. 

The invention also contemplates the use of the nucleic acid sequences as a 
diagnostic for diseases resulting from inherited defective variant sequences, or 
diseases in which the ratio of the amount of the original sequence from which the 
variant was varied to the novel variants of the invention is altered. These 

20 sequences can be detected by comparing the sequences of the defective (i.e., 
mutant) variant coding region with that of a normal coding region. Association 
of the sequence coding for mutant variant product with abnormal variant product 
activity may be verified. In addition, sequences encoding mutant variant products 
can be inserted into a suitable vector for expression in a functional assay system 

25 (e.g., colorimetric assay, complementation experiments in a variant protein 
deficient strain of HEK293 cells) as yet another means to verify or identify 
mutations. Once mutant genes have been identified, one can then screen 
populations of interest for carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 

30 invention may be detected at the DNA level by a variety of techniques. Nucleic 
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acids used for diagnosis may be obtained from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
material. Genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PCR (Saiki, et aL, Nature 324:163-166, (1986)) prior to 
5 analysis. RNA or cDNA may also be used for the same purpose. As an example, 
PCR primers complementary to the nucleic acid of the present invention can be 
used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

10 Point mutations can be identified by hybridizing amplified DNA to 

radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavage method (e.g. Cotton, et alProc. Natl Acad. Set USA, 

15 85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons" (Kostrikis L.G. et aL, Science 279:1228-1229, (1998)), hairpin-shaped, 
single-stranded synthetic oligo- nucleotides containing probe sequences which 
are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 

20 levels of variant product. Such diagnostics would be particularly useful for 
prenatal testing. 

Another method for detecting mutations uses two DNA probes which are 
designed to hybridize to adjacent regions of a target, with abutting bases, where 
the region of known or suspected mutation(s) is at or near the abutting bases. 
25 The two probes may be joined at the abutting bases, e.g., in the presence of a 
ligase enzyme, but only if both probes are correctly base paired in the region of 
probe junction. The presence or absence of mutations is then detectable by the 
presence or absence of ligated probe. 

Also suitable for detecting mutations in the variant product coding 
30 sequence are oligonucleotide array methods based on sequencing by 
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hybridization (SBH), as described, for example, in U.S. Patent No. 5,547,839. In 
a typical method, the DNA target analyte is hybridized with an array of 
oligonucleotides formed on a microchip. The sequence of the target can then be 
"read" from the pattern of target binding to the array. 

5 

D. Gene mapping utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 

io Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for "marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 

15 associated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PCR 
primers (preferably 20-30 bp) from the variant cDNA. Computer analysis of the 
y untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 

20 process. These primers are then used for PCR screening of somatic cell hybrids 
containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the primer will yield an amplified fragment. 

PCR mapping of somatic cell hybrids or using instead radiation hybrids 
are rapid procedures for assigning a particular DNA to a particular chromosome. 

25 Using the present invention with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from specific 
chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies- that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 

30 preselection by hybridization to construct chromosome specific-cDNA libraries. 



-41 - 

Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verma et al } Human Chromosomes: a Manual of 
5 Basic Techniques, (1988) Pergamon Press, New York. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in the OMIM database 
(Center for Medical Genetics, Johns Hopkins University, Baltimore, MD and 

to National Center for Biotechnology Information, National Library of Medicine, 
Bethesda, MD). The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this interval, and 

15 the correlation of translocations and other chromosomal aberrations in this area 
with the advent of polygenic diseases, such as cancer, in general and prostate 
cancer in particular. 

E. Therapeutic applications of nucleic acid sequences 

20 Nucleic acid sequences of the invention may also be used for therapeutic 

purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of variant), expression of variant product may be modulated through 
antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i.e. antisense DNA or RNA, to the 

25 control, 5 T or regulatory regions of the gene encoding variant product. For 
example, the 5' coding portion of the nucleic acid sequence sequence which 
codes for the product of the present invention is used to design an antisense 
oligonucleotide of. from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription start site, e.g. between positions -10 and +10 from 

30 the start site, are preferred. An antisense DNA oligonucleotide is designed to be 
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complementary to a region of the nucleic acid sequence involved in transcription 
(Lee et aL, NucL Acids, Res., 6:3073, (1979); Cooney et al., Science 241:456, 
(1988); and Dervan et al, Science 251:1360, (1991)), thereby preventing 
transcription and the production of the variant products. An antisense RNA 
5 oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 
mRNA molecule into the variant products (Okano J. Neurochem. 56:560, 
(1991)). The antisense constructs can be delivered to cells by procedures known 
in the art such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 

io antisense mRNA. The antisense mRNA or the DNA coding .thereof can be 
complementary to the full sequence of nucleic acid sequences coding for the 
variant protein or to a fragment of such a sequence whidh is sufficient to inhibit 
production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of variant, 

15 expression of variant product may be increased by providing coding sequences 
for coding for said product under the control of suitable control elements ending 
its expression in the desired host. 

The nucleic acid sequences of the invention may be employed in 
combination with a suitable pharmaceutical carrier. Such compositions comprise 

20 a therapeutically effective amount of the compound, and a pharmaceutically 
acceptable carrier or excipient. Such a carrier includes but is not limited to saline, 
buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. 

The products of the invention as well as any activators and deactivators 

25 compounds (see below) which are polypeptides, may also be employed in 
accordance with the present invention by expression of such polypeptides in vivo, 
which is often referred to as "gene therapy. 11 Cells from a patient may be 
engineered with <rnucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 

30 with the polypeptide. Such methods are well-known in the art. For example, cells 



-43- 

may be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide of the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in vivo by procedures known in the art. As known in the art, a producer cell for 
5 producing a retroviral particle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 
and expression of the polypeptide in vivo. These and other methods for 
administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 

10 For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 

Retroviruses from-which the retroviral plasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 

15 spleen necrosis virus, retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 
virus, adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 

20 transfected include, but are not limited to, the PE501, PA317, psi-2, psi-AM, 
PA12, T19-14X, VT-19-17-H2, psi-CRE, psi-CRIP, GP+E-86, GP+envAml2, 
and DAN cell lines as described in Miller (Human Gene Therapy, Vol. 1, 
pg. 5-14, (1990)). The vector may transduce the packaging cells through any 
means known in the art. Such means include, but are not limited to, 

25 electroporation, the use of liposomes, and CaPCU precipitation. In one alternative, 
the retroviral plasmid vector may be encapsulated into a liposome, or coupled to a 
lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 

30 retroviral vector particles then may be employed, to transduce eukaryotic cells, 
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either in vitro or /« vzvo. The transduced eukaryotic cells will express- the nucleic 
acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 
5 myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 
inducible promoters, such as the radiation- inducible Egr-1 promoter, (Maceri, 
HJ., et aL, Cancer Res., 56(19):4311 (1996)), to stimulate variant production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
io tumors. 

Example IV. Variant product 

The substantially purified variant product of the invention has been 
defined above as the product coded from the nucleic acid sequence of the 

15 invention. Preferably the amino acid sequence is an amino acid sequence having 
at least 90% identity to any one of the sequences identified as SEQ ID NO: 18 to 
SEQ ID NO: 34 provided that the amino acid sequence is not identical to that of 
the original sequence from which it has been varied. The protein or polypeptide 
may be in mature and/or modified form, also as defined above. Also 

20 contemplated are protein fragments having at least 10 contiguous amino acid 
residues, preferably at least 10-20 residues, derived from the variant product, as 
well as homologues as explained above. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 

25 having at least 90% sequence identity with any of the products identified as SEQ 
ID NO: 18 to SEQ ID NO: 34, preferably by utilizing conserved substitutions as 
defined above is also part of the invention, and provided that it is not identical to 
the original peptide from which it has been varied. In a more specific 
embodiment, the protein has or contains any one of the sequence identified as 

30 SEQ ID NO: 18 to SEQ ID NO: 34. The variant product may be (i) one in which . 
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one or more of the amino acid residues in a sequence listed above are substituted 
with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue), or (ii) one in which one or more of the amino acid residues 
includes a substituent group, or (iii) one in which the variant product is fused 

5 with another compound, such as a compound to increase the half-life of the 
protein (for example, polyethylene glycol (PEG)), or a moiety which serves as 
targeting means to direct the protein to its target tissue or target cell population 
(such as an antibody), or (iv) one in which additional amino acids are fused to the 
variant product. Such fragments, variants and derivatives are deemed to be within 

10 the scope of those skilled in the art from the teachings herein. 

A. Preparation of variant product 

Recombinant methods for producing and isolating the variant product, and 
fragments of the protein are described above. 

15 In addition to recombinant production, fragments and portions of variant 

product may be produced by direct peptide synthesis using solid-phase techniques 
(cf. Stewart et aL, (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 
Francisco; Merrifield J., J. Am. Chem. Soc, 85:2149-2154, (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation. 

20 Automated synthesis may be achieved, for example, using Applied Biosystems 
431 A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with 
the instructions provided by the manufacturer. Fragments of variant product may 
be chemically synthesized separately and combined using chemical methods to 
produce the full length molecule. 

25 

B. Therapeutic uses and compositions utilizing the variant product 

The variant product of the invention is generally useful in treating diseases 
and disorders which are characterized by a lower than normal level of variant 
30 expression, and or diseases which can be cured or ameliorated by raising the level 
of the variant product, even if the level is normal. 
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Variant products or fragments may be administered by any of a number of 
routes and methods designed to provide a consistent and predictable 
concentration of compound at the target organ or tissue. The product-containing 
compositions may be administered alone or in combination with other agents, 
5 such as stabilizing compounds, and/or in combination with other pharmaceutical 
agents such as drugs or hormones. 

Variant product-containing compositions may be administered by a 
number of routes including, but not limited to oral, intravenous, intramuscular, 
transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 
10 application, variant product-containing compositions may also be administered 
via liposomes. Such administration routes and appropriate formulations are 
generally known to those of skill in the art. 

The product can be given via intravenous or intraperitoneal injection. 
Similarly, the product may be injected to other localized regions of the body. The 
15 product may also be administered via nasal insufflation. Enteral administration is 
also possible. For such administration, the product should be formulated into an 
appropriate capsule or elixir for oral administration, or into a suppository for 
rectal administration. 

The foregoing exemplary administration modes will likely require that the 
20 product be formulated into an appropriate carrier, including ointments, gels, 
suppositories. Appropriate formulations are well known to persons skilled in the 
art. 

Dosage of the product will vary, depending upon the potency and 
therapeutic index of the particular polypeptide selected. 

25 A therapeutic composition for use in -the treatment method can include the 

product in a sterile injectable solution, the polypeptide in an oral delivery vehicle, 
the product in an aerosol suitable for nasal administration, or the product in a 
nebulized form, -all prepared according to well known methods. Such' 
compositions comprise a therapeutically effective amount of the compound, and a 

30 pharmaceutical^ acceptable carrier or excipient. Such a carrier includes but is net 
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limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The product of the invention may also be used to modulate 
endothelial differentiation and proliferation as well as to modulate apoptosis 
either ex vivo or in vitro, for example, in cell cultures. 

5 

Example V. Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
such as synthetic drugs, antibodies, peptides, or other molecules, which have a 

10 modulating effect on the activity of the variant product, e.g. activators or 
deactivators of the variant product of the present invention. Such art assay 
comprises the steps of providing an variant product encoded by the nucleic acid 
sequences of the present invention, contacting the variant protein with one or 
more candidate molecules to determine the candidate molecules modulating 

15 effect on the activity of the variant product, and selecting from the molecules a 
candidate's molecule capable of modulating variant product physiological 
activity. 

The variant product, its catalytic or immunogenic fragments or 
oligopeptides thereof, can be used for screening therapeutic compounds in any of 

20 a variety of drug screening techniques. The fragment employed in such a test 
may be free in solution, affixed to a solid support, borne on a cell membrane or 
located intracellularly. The formation of binding complexes, between variant 
product and the agent being tested, may be measured. Alternatively, the activator 
or deactivator may work by serving as agonist or antagonist, respectively, of the 

25 variant receptor, binding entity or target site, and their effect may be determined 
in connection with any of the above. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity to the 
variant product is described in detail by Geysen in PCT Application WO 

30 84/03564, published on Sep. 13, 1984. In summary, large numbers of different 
small peptide test compounds are synthesized on a solid substrate, such as plastic 
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pins or some other surface. The peptide test compounds are reacted with the full 
variant product or with fragments of variant product and washed. Bound variant 
product is then'detected by methods well known in the art. Substantially purified 
variant product can also be coated directly onto plates for use in the 
5 aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

Antibodies to the variant product, as described in Example VI below, may 
also be used in screening assays according to methods well known in the art. For 

10 example, a "sandwich" assay may be performed, in which an anti -variant 
antibody is affixed to a solid surface such as a microtiter plate and variant 
product is added. Such an assay can be used to capture compounds which bind to 
the variant product. Alternatively, such an assay may be used to measure the 
ability of compounds to influence with the binding of variant product to the 

15 variant receptor, and then select those compounds which effect the binding. 

Example VI. Anti-variant antibodies 
A. Synthesis 

In still another aspect of the invention, the purified variant product is used 
20 to produce anti-variant antibodies which have diagnostic and therapeutic uses 
related to the activity, distribution, and expression of the variant product. 

Antibodies to the variant product may be generated by methods well 
known in the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, humanized, single chain, Fab fragments and fragments 
25 produced by an Fab expression library. Antibodies, i.e., those which inhibit 
dimer formation, are especially preferred for therapeutic use. 

A fragment of the variant product for antibody induction does not require 
biological activity- but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 
30 specific antibodies may have an amino acid sequence consisting of at least five 
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amino acids, preferably at least 10 amino acids of the sequences specified in any 
one of SEQ ID NO: 18 to SEQ ID NO: 34. Preferably they should mimic a 
portion of the amino acid sequence of the natural protein and may contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short 
5 stretches of variant protein amino acids may be fused with those of another 
protein such as keyhole limpet hemocyanin and antibody produced against the 
chimeric molecule. Procedures well known in the art can be used for the 
production of antibodies to variant product. 

For the production of antibodies, various hosts including goats, rabbits, 

io rats,- mice, etc may be immunized by injection with variant product or any 
portion, fragment or oligopeptide which retains immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to Freund's, 
mineral gels such as aluminum hydroxide, and surface active substances such as 

15 lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to variant protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 

20 cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein {Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et al, Immunol Today 4:72, (1983); 
Cote etal, Proc. Natl. Acad. Sci. 80:2026-2030, (1983)) and the EBV-hybridoma 
technique (Cole, etal, Mol Cell Biol 62:109-120, (1984)). 

25 Techniques developed for the production of "chimeric antibodies", the 

splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity can also be used 
(Morrison et al, Proc. Natl Acad. Sci. 81:6851-6855, (1984); Neuberger et al, 
Nature 312:604-608, (1984); Takeda et al, Nature 314:452-454, (1985)). 

30 Alternatively, techniques described for the production of single chain antibodies 
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(U.S. Patr No. 4,946,778) can be adapted to produce single-chain antibodies 

specific for the variant protein. 

Antibodies may also be produced by inducing in vivo production in the 

lymphocyte population or by screening recombinant immunoglobulin libraries or 
5 panels of highly specific binding reagents as disclosed in Orlandi et al. (Proc. 

Natl Acad. Sci. 86:3833-3837, 1989)), and Winter G and Milstein C, {Nature 

349:293-299,(1991)). 

Antibody fragments which contain specific binding sites for variant 

protein may also be generated. For example, such fragments include, but are not 
10 limited to, the F(ab')2 fragments which can be produced by pepsin digestion of 

the antibody molecule and the Fab fragments which can be generated by reducing 

the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 

libraries may be constructed to allow rapid and easy identification of monoclonal 

Fab fragments with the desired specificity (Huse W.D. et al } Science 
15 256:1275-1281,(1989)). 

B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 

20 specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between the variant product and its specific antibody and 
the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two noninterfering 
epitopes on a specific variant product is preferred, but a competitive binding 

25 assay may also be employed. These assays are described in Maddox D.E., et al, 
(J. Exp. Med. 158:1211, (1983)). 

Antibodies which specifically bind variant product are useful for the 
diagnosis of conditions or diseases characterized by expression of the novel 
variant of the invention (where normally it is not expressed) by over or under 

30 expression of variant as well as for detection of diseases in which the proportion 
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between the amoxint of the variants of the invention and the original sequence 
from which it varied is altered. Alternatively, such antibodies may be used in 
assays to monitor patients being treated with variant product, its activators, or its 
deactivators. Diagnostic assays for variant protein include methods utilizing the 
5 antibody and a label to detect variant product in human body fluids or extracts of 
cells or tissues. The products and antibodies of the present invention may be 
used with or without modification. Frequently, the proteins and antibodies will be 
labeled by joining them, either covalently or noncovalently, with a reporter 
molecule. A wide variety of reporter molecules are known in the art. 

10 A variety of protocols for measuring the variant product, using either 

polyclonal or monoclonal antibodies specific for the respective protein are known 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 

15 antibodies reactive to two non- interfering epitopes on variant product is 
preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, et al. (supra). Such protocols provide 
a basis for diagnosing altered or abnormal levels of variant product expression. 
Normal or standard values for variant product expression are established by 

20 combining body fluids or cell extracts taken from normal subjects, preferably 
human, with antibody to variant product under conditions suitable for complex 
formation which are well known in the art. The amount of standard complex 
formation may be quantified by various methods, preferably by photometric 
methods. Then, standard values obtained from normal samples may be compared 

25 with values obtained from samples from subjects potentially affected by disease. 
Deviation between standard and subject values establishes the presence of disease 
state. 

The antibo4y assays are useful to determine the level of variant product 
present in a body fluid sample, in order to determine whether it is being 
30 expressed at all, whether it is being overexpressed or underexpressed in the 
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tissue, or as an indication of how variant levels of variable products are 
responding to drug treatment. 

By another aspect the invention concerns methods for determining the 
presence or level of various anti-variant antibodies in a biological sample 
5 obtained from patients, such as blood or serum sample using as an antigen the 
variant product. Determination of said antibodies may be indicative to a plurality 
of pathological conditions or diseases. 

C. Therapeutic uses of antibodies 

10 In addition to their diagnostic use the antibodies may have a therapeutical 

utility in blocking or decreasing the activity of the variant product in pathological 
conditions where beneficial effect can be achieved by such a decrease. 

The antibody employed is preferably a humanized monoclonal antibody, 
or a human Mab produced by known globulin-gene library methods. The 

15 antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 
in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 
continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

20 Although the invention has been described with reference to specific 

methods and embodiments, it is appreciated that various modifications and 
changes may be made without departing from the invention. 
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CLAIMS: 

1. An isolated nucleic acid sequence, of an alternative splicing variant, selected 
from the group consisting of: 

(i) the nucleic acid sequence depicted in any one of SEQ ID NO: 1 to 
5 SEQ ID NO: 17; 

(ii) nucleic acid sequences having at least 90% identity with the 
sequence of (i) with the proviso that each sequence is different than the original 
nucleic acid sequence from which the sequences of (i) have been varied by 
alternative splicing; and 

io (iii) fragments of (i) or (ii) of at least 20 b. p., provided that said fragment 

contains a sequence which is not present, as a continuous stretch of nucleotides, in 
the original nucleic acid sequence from which the sequences of (i) have been 
varied by alternative splicing. 

2. An isolated nucleic acid sequence complementary to the nucleic acid 
is sequence of Claim 1. 

3. An amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence coded by the isolated nucleic acid sequence 
of alternative splice variants of Claim 1 ; 

(ii) homologues of the amino acid sequences of (i) in which one or more 
20 amino acids has been added, deleted, replaced or chemically 

modified in the region or adjacent to the region where the amino 
acid sequences differs from the original amino acid sequence, coded 
by the original nucleic acid sequence from which the variant has been 
varied. 

25 4. An amino acid sequence according to Claim 3, as depicted in any one of 
SEQ ID NO: 18 to SEQ ID NO: 34. 

5. An isolated nucleic acid sequence coding for any one of the amino acid 
sequences of Claim 3 or 4. 

6. A purified antibody which binds specifically to any of the amino acid 
30 sequence of Claim 3 or 4. 
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7. An expression vector comprising any one of the nucleic acid sequences of 
Claim 1 or 5 and control elements for the expression of the nucleic acid sequence in 
a suitable host. 

8. An expression vector comprising any one of the nucleic acid sequences of 
5 Claim 2, and control elements for the expression of the nucleic acid sequences in a 

suitable host. 

9. A host cell transfected by the expression vector of Claim 7 or 8.. 

10. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

io (i) the expression vector of Claim 7; and 

(ii) any one of the amino acid sequences of Claim 3 or 4. 

11. A pharmaceutical composition according to Claim 10, for treatment of 
diseases which can be ameliorated or cured by raising the level of any one of the 
amino acid sequences depicted in SEQ ID NO: 18 to SEQ ID NO: 34. 

15 12. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) any one of the nucleic acid sequences of Claim 2; 

(ii) the expression vector of Claim 8; and 

(iii) the purified antibody of Claim 6. 

20 13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can be ameliorated or cured by decreasing the level of any one of 
the amino acid sequences depicted in SEQ ID NO: 18 to SEQ ID NO: 34. 
14. A method for detecting an variant nucleic acid sequence in a biological 
sample, comprising the steps of: 

25 (a) hybridizing to nucleic acid material of said biological sample any one 

of the nucleic acid sequences of Claim 1 or 2; and 
(b) detecting said hybridization complex; 

wherein the presence of said hybridization complex correlates with the 
presence of an variant nucleic acid sequence in the said biological sample. 
30 15. A method for determining the level of variant nucleic acid sequences in a 
biological sample comprising the steps of: 
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(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim 1 or 2; and 

(b) detennining the amount of Hybridization complexes and normalizing 
said amount to provide the level of the variant nucleic acid sequences in the 

5 sample. 

16. A method for determining the ratio between the level of variant of the 
nucleic acid sequence in a first biological sample and the level of the original 
sequence from which the variant has been varied by alternative splicing in a second 
biological sample comprising: 
10 (a) determining the level of the variant nucleic acid sequence in the first 

biological sample according to the method of Claim 15; 

(b) determining the level of the original sequence in the second biological 
sample; and 

(c) comprising the levels obtained in (a) and (b) to give said ratio. 

15 17. A method according to Claim 16, wherein said first and said second 
biological samples are the same sample. 

18- A method according to any of Claims 14 to 17, wherein the nucleic acid 
material of said biological sample are mRNA transcripts. 

19. A method according to Claim 18, where the nucleic acid sequence is present 
20 in a nucleic acid chip. 

20. A method for identifying candidate compounds capable of binding to the 
variant product and modulating its activity the method comprising: 

(i) providing any one of the amino acid sequences as defined in Claim 3 

or 4; 

25 (ii) contacting a candidate compound with said amino acid sequence; 

(iii) determining the effect of said candidate compound on the biological 
activity of said protein or polypeptide and selecting those compounds which show a 
significant effect on said biological activity. 

21. A method according to Claim 20, wherein the compound is an activator and 
30 the measured effect is increase in the biological activity. 
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22. A method according~to Claim 20, wherein the compound is an deactivator 
and the effect is decrease in the biological activity. 

23. An activator of any one of the amino acid sequences of Claim 3 or 4. 

24. An deactivator of any one of the amino acid sequences of Claims 3 or 4. 

5 25. A method for detecting any one of the amino acid sequences of Claim 3 or 4 
in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

10 wherein the presence of said antibody-antigen complex correlates with the 

presence of the desired amino acid in said biological sample. 

26. A method for detecting the level of the amino acid^sequence of any one of 
Claim 3 or 4 in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
15 thereby forming an antibody-antigen complex; and 

(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the level of said amino acid sequence in the 
sample. 

27. A method for determining the ratio between the level of any one of the 
20 amino acid sequences of Claims 3 or 4 present in a first biological sample and the 

level of the original amino acid sequences from which they were varied by 
alternative splicing, present in a second biological sample, the method comprising: 

(a) detennining the level of the amino acid sequences of Claims 3 or 4 
into a first sample by the method of Claim 26; 
25 (b) determining the level of the original amino acid sequence in the 

second sample; and 

(c) comparing the level obtained in (a) and (b) to give said ratio. 

28. A method according to Claim 27, wherein said first and said second 
biological samples^are the same sample. 

30 29. A method for detecting any one" of the antibodies of Claim 6 in- a biological 
sample comprising the steps ofu. 



(a) contacting said biological sample with any one of the amino acid 
sequences of Claim 3 or 4 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of the antibody in said biological sample. 

30. A method for detecting the level of any one of the antibodies of Claim 6 in a 
biological sample comprising the steps of: 

(a) contacting said biological sample with any one of the amino acid 
sequences of Claim 3; 

(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the levels of said antibody in the sample. 
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SEQUENCE LISTING 



<110> COMPUGEH LTD 

<120> ALTERNATICE SPLICE VARIANTS 

<130> COMPUGEN6 

<140> 
<141> 

<160> 34 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1530 
<212> DMA 
<213> HUMAN 



<400> 1 

caggccttga 

aacgcgccga 

gccgtcaacc 

caggggacgg 

agctttaaga 

gaattccagc 

atcagcaacg 

atgagcggca 

ggcagtaacc 

aacagtgtcc 

aaaaaaattg 

aa t tctgtat 

gattctgtgg 

ctggatgtac 

gctccaccag 

gactacttct 

cttagagatg 

accttcctac 

gcctccaccc 

gatgtccagg 

atggtaagca 

ctttggagtc 

ggagcctgag 

ccagccattc 

tcattcattc 

catttatatc 



ggttttggca 
gatgggtgtc 
ctggcgt cgt 
ccgctctgca 
tcaagcatct 
ttcccagttc 
ccaatatcaa 
attttgacct 
ccacgtcagg 
acgtgcacat 
agtctgcgct 
cctccaagct 
ctggaatcaa 
agatgaaggg 
tgatggagtt 
tcaacacagc 
acatgattcc 
ctgaggtggc 
cgccacacct 
cctttgccgt 
gttcctgggt 
aggagaccat 
gcttgagtca 
actagtgcgt 
anttattcat 
tttagtcatt 



gctctggagg 
cctgatggtg 
ggtcaggatc 
gaaggagctg 
tgggaagggg 
ccaga taagc 
gatcagcggg 
gagcatagaa 
caagcccacc 
ctcaaagagc 
t cgaaacaag 
gcaacctta t 
ctatggtctg 
ggagttttac 
tcccgctgcc 
cgggcttgta 
aaaggagtcc 
caagaagttt 
gtctgtgcag 
cctccccaac 
tggacagatg 
gtgaatcctg 
ctgtactcaa 
ttgtttactt 
tccatgttgg 
caa ttatgca 



atgagagaga 
ctcgtcgcca 
tcccagaagg 
aagaggatca 
cattatagct 
atggtgccca 
aaatggaagg 
ggcatgtcca 
atcacctgct 
aaagtcgggt 
atgaacagcc 
ttccagactc 
gtggcacctc 
agtgagaacc 
catgaccgca 
taccaagagg 
aaat ttcgac 
cccaacatga 
cccaccggcc 
tcctccctgg 
aggagcccca 
tctggattca 
tggtgccgac 
attcattcaa 
cttgaaatat 



acatggcca'g 
taggcaccgc 
gcctggacta 
agattcctga 
tctacagcat 
atgtgggcct 
cacaaaagag 
ttt cggctga 
ccagctgcag 
ggctgatcca 
aggtctgcga 
tgccagtaat 
cagcaaccac 
accacaat cc 
tggtatacct 
ctggggtctt 
tgacaaccaa 
agatacagat 
ttaccttcta 
cttccctctt 
gacagtccca 
aacctggact 
tcctggaggt 
ttattcattc 
gtgtactgtn 



gggcccttgc 
cgtgacagcg 
cgccagccag 
ctactcagac 
ggacat ccgt 
taagtt ctcc 
attcttaaaa 
tctgaagctg 
cagccacatc 
actcttccac 
gaaagtgacc 
gaccaaaata 
ggctgagacc 
acctcccttt 
gggcctctca 
gaagatgacc 
gttctttgga 
ccatgtctca 
ccctgccgtg 
cctgattggc 
acagcactgt 
gtgtcactcc 
attcattcac 
agtcaatttc 
ccaattnatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1530 



<210> 2 
<211> 2251 
<212> DNA 
<213> HUMAN 

<400> 2 

ttttttcagg ggtatgaggc 
ggtcccactc ttgtggcctc 
caggcatctg agcttccttg 
ccgtgggggg cacagggggg 
agtcacccgt agcggggatg 
agtcacccgt gggcggcacg 



tttattcagg aagagctcaa 
ttgataccaa ggctcatggg 
gagtcatctg tggggggcac 
gccccggagt cacccgtggg 
gggcagcctc agtgtcaccc 
gggggggccc cggagtcacc 



gatgggaggg gagcccctgg 60 
acgctaaaac ctaatgactg 120 
aggggcagcc tcagagtcac 180 
cggcacgggg gcggtctcgg 240 
aaggtgggca. cgggcctcag 300 
cgtgggcggc acgggggggg 360 



ccccggagtc acccgtgggc ggcacggggg 
gggcggtctc ggagtcaccc gtggggggca 
gcacaggggt ggcctcctgg tcggtcactg 
agcgcaggaa gttggttctc aggctccgct 
tctccaggta gccgctgttt tccgtagtgt 
cgcccatgtt ggggtcccct gttttggcaa 
agactgtcct gtcttggggc cggtagcccg 
actgaatgtc atctgcatgg tcggccccca 
gatgggaaaa caggtaggcg taggtcttgg 
gggcaatctc ggtgggcacc aggaagagga 
tcttattctc ctgggatggg tcctgggccc 
tggcgcctct gagccccttg gtgattgtga 
ccgtgacttt cttgttgccc ttgttgatgg 
cgtccatgtt gttggtgcct gctatatagt 
tcgggtcagc ggggatgaag tctccatcaa 
tggggtactc caggcctgcc agcggcacct 
taaccttcag acactgggcc atcctggcgg 
ccaccttttt ggcccagaag agtgggtttt 
cgctctggct gatggctcgc cggatgaggc 
agacgctggc acctccagca gactccccga 
aggccgcgat attcctcttc acccaagcaa 
tacctggcag attggcgtcc ccagtgctga 
aggtgaccac gatgacgttt ccgcgtgtgg 
tgaggaagtt ggccccatgg ccggacccca 
taacgggcag gtcccgggag acttgcttcc 
ggcagtcttc atccccgtag gtgctgtcct 
tgaagttctt ggccttcagg gtcccttgcc 
ccttggtggg agctgcgaag gggatgccct 
cgagcttctt attgacgcct tccacgaacc 
cggcactcgc cactgcccag cagcaggtga 
tggtgagcat cagcctctgg atgaataaac 
tccatcttga ca'cttaactg atacagaagg 



gggccccgga gtcacccgtg ggcggcacgg 420 
cgggagtggc ctcggagtcc cctgtggggg 480 
tgggcagcgc cagataggtg aggytccagt 540 
tcatggagct gctgcccatc ttcttggtga 600 
agggttccca gtgtgtgggc acagccgagt 660 
agttggtcca gtaggcgatc atggccttag 720 
tgggggtggc gaagggcttc ccgaaaacgt 780 
cccatttggg gtagacgggc atccgagagg 840 
cactcttggc attggctctg tgctgggcta 900 
catcggtctc aaagtccacc acagtcttct 960 
aggactcggt gtagacatca aaggtcgtct 1020 
actcactgac cagcttgtag aagtcctcct 1080 
caggcatgtc gatgctggcg aagatgtggc 1140 
cgatgtcggc ggcgttggcg tacaggttga 1200 
tgacagggac gaagcccaca tagtgcagca 1260 
tataggccag cgtcagggct cggggatcag 1320 
catcacccac agggcaaccc accttctcag 1380 
tctggatgac ccagggactc agggccacgc 1440 
ccttgttgta gggggagagg gtctgcagag 1500 
agagcgtgat gttgttgggg tcccccccga 1560 
tggccatgtg ctgatcccga agaccatagt 1620 
ggaacccaag ggggccgaca cggtagttga 1680 
cgatctcctc gccgtcatac aggtagttgt 1740 
tgaggaaggc gcctccatag atccagatca 1800 
tgccctgggg cacccaaatg ttgaggtaca 1860 
gggtgatggt ggcctgcagg catctcttct 1920 
agccaggatg tggctgagga ttttccaggg 1980 
tgaagatgtc cacagagtca cccaggaggc 2040 
caccttctgt gtacacggcg cccagcttcg "2100 
ggcccaacac aaccagttgc aggcgcccca 2160 
tctttgcatt atcttgagac tgctgttcct 2220 
t 2251 



<210> 3 
<211> 2078 
<212> DNA 
<213> HUMAN 



<400> 3 

ctgctgctcc 

gaagccccca 

ggggagccct 

aagggccggc 

cgatggggat 

tgccagaaag 

cacct cactg 

ttccacaaga 

aagggtcctg 

ctccatgggg 

tacaccggac 

agctaccgcg 

gaggccacct 

gccttctgcc 

cggctgagct 

ccgaccccgg 

aagcctcagc 

aagcgggagc 

cgcaagagtc 

cacccctaca 

ccctgctggg 



tggggttcct 
aggagcataa 
gccacttccc 
caggccctca 
actgtttgga 
gagggacctg 
gaaaccactg 
atgagatatg 
atgcccactg 
gtcgctgcct 
ccttctgcga 
gcctggccag 
accggaacgt 
ggaacccgga 
gggagtactg 
tgtcccctag 
ccacgacccg 
agccgccttc 
tgtcttcgat 
tcgccgcgct 
tgctgacggc 



gctggtgagc 
gtacaaagct 
cttccagtac 
gccctggtgt 
gcccaagaaa 
tgtgaacatg 
ccagaaagag 
gtatagaact 
ccagcggctg 
agaggtggag 
cgtggacacc 
gaccacgctc 
gactgccgag 
caacgacatc 
cgacctggca 
.gcttcatgtc 
gaccccgcct 
cctgaccagg 
gacccgcgtc 
gtactggggc 
cgctcactgc. 



ttggagtcaa 
gaagagcaca 
caccggcagc 
gctaccaccc 
gtgaaagacc 
ccaagcggcc 
aagtgctttg 
gagcaagcag 
gccagccagg 
ggccaccgcc 
aaggcaagct 
tcgggtgcgc 
caagcgcgga 
cgcccgtggt 
cagtgccaga 
ccactcatgc 
cagtcccaga 
aacggcccac 
gttggcgggc 
cacagtttct 
.ctgcaggacc 



cactttcgat 
cagtcgttct 
tgtaccacaa 
ccaactttga 
actgcagcaa 
cccactgtct 
agcctcagct 
ctgtggccag 
cctgccgcac 
tgtgccactg 
gctatgatgg 
cctgtcagcc 
actggggact 
gcttcgtgct 
ccccaaccca 
ccgcgcagcc 
ccccgggagc 
tgagctgcgg 
tggtggcgct 
gcgccggcag 
ggcccgcacc 



tccaccttgg 
cactgtcacc 
atgtacccac 
tcaggaccag 
acacagcccc 
ctgtccacaa 
tctccggttt 
atgccagtgc 
caacccgtgc 
cccggtgggc 
ccgcgggctc 
gtgggcctcg 
gggcggccac 
gaaccgcgac 
ggcggcgcct 
ggcaccgccg 
ct" tgccggcg 
gcagcggctc 
acgcggggcg 
cctcatcgcc 
cgaggatctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



2 



acggtggtgc tcggccagga acgccgtaac cacagctgtg agccgtgcca gacgttggcc 1320 

gtgcgctcct accgcttgca cgaggccttc tcgcccgtca gctaccagca cgacctggct 1380 

ctgttgcgcc ttcaggagga tgcggacggc agctgcgcgc tcctgtcgcc ttacgttcag 1440 

ccggtgtgcc tgccaagcgg cgccgcgcga ccctccgaga ccacgctctg ccaggtggcc 1500 

ggctggggcc accagttcga gggggcggag gaatatgcca gcttcctgca ggaggcgcag 1563 

gtaccgttcc tctccctgga gcgctgctca gccccggacg tgcacggatc ctccatcctc 1620 

cccggcatgc tctgcgcagg gttcctcgag ggcggcaccg atgcgtgcag gtgagctctt 1680 

agccggttgg cgcccttccc cgaggccgtc agcawaatct caggtccaca gcgctgactg 1740 

cgtgtttccg acccagggtg attccggagg cccgctggtg tgtgaggacc aagctgcaga 1800 

gcgccggctc accctgcaag gcatcatcag ctggggatcg ggctgtggtg accgcaacaa 1860 

gccaggcgtc tacaccgatg tggcctacta cctggcctgg atccgggagc acaccgtttc 1920 

ctgattgctc agggactcat ctttccctcc ttggtgattc cgcagtgaga gagtggctgg 1980 

ggcatggaag gcaagattgt gtcccattcc cccagtgcgg ccagctccgc gccaggatgg 2040 

cgcaggaact caataaagtg ctttgaaaat gctgagaa 2078 

<210> 4 

<211> 720 

<212> DNA 

<213> HUMAN . 

<400> 4 

tcgaccgccc cttggcttct gcactgatgg tgggtggatg agtaatgcat ccaggaagcc 60 

tggaggcctg tggtttccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 

ctagcgctca ggccctgcgc gcatcgccgc agatccagcg cccagagaga caccagagaa 180 

cccaccatgg ccccctttga gcccctggct tctggcatcc tgttgttgct gtggctgata 240 

gcccccagca gggcctgcac ctgtgtccca ccccacccac agacggcctt ctgcaattcc 300 

gacctcgtca tcagggccaa gttcgtgggg acaccagaag tcaaccagac caccttatac 360 

cagcgttatg agatcaagat gaccaagatg tataaagggt tccaagcctt aggggatgcc 420 

gctgacatcc ggttcgtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 

tcccacaacc gcagcgagga gtttctcatt gctggaaaac tgcaggatgg actcttgcac 540 

atcactacct gcagttttgt ggctccctgg aacagcctga gcttagctca gcgccggggc 600 

ttcaccaaga cctacactgt tggctgtgag gaatgcacag tgtttccctg ttcccactcc 660 

catctttctt ccggacaatg aaataaagag ttaccaccca gcagaaaaaa caaacaagtc 720 



<210> 5 
<211> 556 
<212> DNA 
<213> HUMAN 



<400> 5 

tcgaccgccc 

tggaggcctg 

ctagcgctca 

cccaccatgg 

gcccccagca 

gacctcgtca 

cagcgttatg 

gctgacatcc 

tcccacaacc 

agtccatccg 



cttggcttct 
tggtttccgc 
ggccctgcgc 
ccccctttga 
gggcctgcac 
tcagggccaa 
agatcaagat 
ggttcgtcta 
gcagcgagga 
tcgtac 



gcactgatgg 
acccgctgcc 
gcatcgccgc 
gcccctggct 
ctgtgtccca 
gttcgtgggg 
gaccaagatg 
cacccccgcc 
gtttctcatt 



tgggtggatg 
acccccgccc 
agatccagcg 
tctggcatcc 

CCCC3LCCC3LC 

acaccagaag 
tataaagggt 
atggagagtg 
gctggaaaac 



agtaatgcat 
ctagcgtgga 
cccagagaga 
tgttgttgct 
agacggcctt 
tcaaccagac 
tccaagcctt 
tctgcggata 
tgcaggtagt 



ccaggaagcc 
catttatcct 
caccagagaa 
gtggctgata 
ctgcaattcc 
caccttatac 
aggggatgcc 
cttccacagg 
gatgtgcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

556 



<210> 6 
<211> 934 
<212> DMA 
<213> HUMAN 



<400> 6 

tcgaccgccc cttggcttct gcactgatgg tgggtggatg agtaatgcat ccaggaagcc 60 



tggaggcctg tggtttccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 

ctagcgctca ggccctgcgc gcatcgccgc agatccagcg cccagagaga caccagagaa 180 

cccaccatgg ccccctttga gcccctggct tctggcatcc -tgttgttgct gtggctgata 240 

gcccccagca gggcctgcac ctgtgtccca ccccacccac agacggcctt ctgcaattcc 300 

gacctcgtca tcagggccaa gttcgtgggg acaccagaag tcaaccagac caccttatac 360 

cagcgttatg agatcaagat gaccaagatg tataaagggt tccaagcctt aggggatgcc 420 

gctgacatcc ggttcgtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 

tcccacaacc gcagcgagga gtttctcatt tgctgggaaa attgcaggat gggatttttg 540 

cacattcact aacctgcagt ttttgttggg ttccttggga acagcctgag cttagctcag 600 

cgccggggct tcaccaagac ctacactgtt ggctgtgagg aatgcacagt gtttccctgt 660 

ttatccatcc cctgcaaact gcagagtggc actcattgct tgtggacgga ccagctcctc 720 

caaggctctg aaaagggctt ccagtcccgt caccttgcct gcctgcctcg ggagccaggg 780 

ctgtgcacct ggcagtccct gcggtcccag atagcctgaa tcctgcccgg agtggaagct 840 

gaagcctgca cagtgtccac cctgttccca ctcccatctt tcttccggac aatgaaataa 900 

agagttacca cccagcagaa aaaacaaaca agtc 934 



<210> 7 
<211> 783 
<212> DNA 
<213> HUMAN 



<400> 7 

tcgaccgccc 

tggaggcctg 

ctagcgctca 

cccaccatgg 

gcccccagca 

gacctcgtca 

cagcgttatg 

gctgacatcc 

tcccacaacc 

ctcattgctt 

accttgcctg 

tagcctgaat 

tccca tcttt 

gtc 



cttggcttct 
tggtttccgc 
ggccctgcgc 
ccccctttga 
gggcctgcac 
tcagggccaa 
agatcaagat 
ggttcgtcta 
gcagcgagga 
gtggacggac 
cctgcctcgg 
cctgcccgga 
cttccggaca 



gcactgatgg 
acccgctgcc 
gcatcgccgc 
gcccctggct 
ctgtgtccca 
gttcgtgggg 
gaccaagatg 
cacccccgcc 
gtttctcatt 
cagctcctcc 
gagccagggc 
gtggaagctg 
a tgaaataaa 



tgggtggatg 
acccccgccc 
agatccagcg 
tctggcatcc 
ccccacccac 
acaccagaag 
tataaagggt 
atggagagtg 
tatcca t ccc 
aaggctctga 
tgtgcacctg 
aagcctgcac 
gagttaccac 



agtaatgcat 
ctagcgtgga 
cccagagaga 
tgttgttgct 
agacggcctt 
tcaaccagac 
tccaagcctt 
tctgcggata 
ctgcaaactg 
aaagggctt c 
gcagtccctg 
agtgtccacc 
ccagcagaaa 



ccaggaagcc 
catttatcct 
caccagagaa 
gtggctgata 
ctgcaattcc 
caccttatac 
aggggatgcc 
cttccacagg 
cagagtggca 
cagtcccgtc 
cggtcccaga 
ctgttcccac 
aaacaaacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

783 



<210> 8 
<211> 895 
<212> DNA 
<213> HUMAN 

<400> 8 

tcgaccgccc cttggcttct gcactgatgg tgggtggatg agtaatgcat ccaggaagcc 60 

tggaggcctg tggtttccgc acccgctgcc acccccgccc ctagcgtgga catttatcct 120 

ctagcgctca ggccctgcgc gcatcgccgc agatccagcg cccagagaga caccagagaa 180 

cccaccatgg ccccctttga gcccctggct tctggcatcc tgttgttgct gtggctgata 240 

gcccccagca gggcctgcac ctgtgtccca ccccacccac agacggcctt ctgcaattcc 300 

gacctcgtca tcagggccaa gttcgtgggg acaccagaag tcaaccagac caccttatac 360 

cagcgttatg agatcaagat gaccaagatg tataaagggt tccaagcctt aggggatgcc 420 

gctgacatcc ggttcgtcta cacccccgcc atggagagtg tctgcggata cttccacagg 480 

gctggaaaac tgcaggatgg actcttgcac atcactacct gcagttttgt ggctccctgg 540 

aacagcctga gcttagctca gcgccggggc ttcaccaaga cctacactgt tggctgtgag 600 

gaatgcacag tgtttccctg tttatccatc ccctgcaaac tgcagagtgg cactcattgc 660 

ttgtggacgg accagctcct ccaaggctct gaaaagggct tccagtcccg tcaccttgcc 720 

tgcctgcctc gggagccagg gctgtgcacc tggcagtccc tgcggtccca gatagcctga 780 

atcctgcccg gagtggaagc tgaagcctgc acagtgtcca 'Ccctgttccc actcccatct 840 

ttcttccgga caatgaaata aagagttacc acccagcaga aaaaacaaac aagtc 895 



4 



<210> 9 

<21I> 2565 

<212> DNA 

<213> HUMAN 



<400> 9 

ggcacgagca gccccggagc ccgggccagg 
ccctcctgcc gcagccaccg agccgccgtc 
ctgctggcgc gcctgcttct ctgcgtcctg 
cttcatcaag ttccatcgaa ctgtgactgt 
tacttctcca acattcactg gtgcaactgc 
atagataagt caaaaacctg ctatgagggg 
actgacacca tgggccggcc ctgcctgccc 
taccatgccc acagatctga tgctcttcag 
gaggtggggg cacaaggacc aaaagccctc 
cccttctccc agagggctgg ccatagcaca 
ctgaggggag gaggcaggga aggccctctg 
ttgccaggaa cccagacaac cggaggcgac 
cgcttgtcca agagtgcatg gtgcatgact 
cagaagaatt aaaatttcag tgtggccaaa 
ggggagaatt caccaccatc gagaaccagc 
gggggggctc tgtcacctac gtgtgtggag 
gcgccacaca ctgcttcatt gattacccaa 
gctcaaggct taactccaac acgcaagggg 
tacacaagga ctacagcgct gacacgcttg 
tccgttccaa ggagggcagg tgtgcgcagc 
cctcgatgta taacgatccc cagtttggca 
agaattctac cgactatctc tatccggagc 
cccaccggga gtgtcagcag ccccactact 
gtgctgctga cccacagtgg aaaacagatt 
tctgttccct ccaaggccgc atgactttga 
ccctgaagga caagccaggc gtctacacga 
gtcacaccaa ggaagagaat ggcctggccc 
cacccgcttt cttgctggtt gtcatttttg 
gagactggga agataggctc tgcacagatg 
acaatagctt taccctcagg cataggcctg 
gatggagggg tggtcctgac tcaacatgtt 
gaagcctgca ggagttaaaa agggcagggc 
gctcccccga cggtgggcat ttgtgaggcc 
aggaagtgta acagctgagg tctcttgagg 
gggagcagag acactaacga cttcagggca 
aatatatatg tgtgtgtatg tttgcacact 
aagagctggt gtctgattgt taagtctaaa 
cacacagagt ggtctttctg gagaggttat 
acgtgacagt gcctgggaat gtattattct 
gtttcacttt cacatagatg tccctttctt 
catccaatcc tcactgggtg gggtgaggac 
tatttttatt tatatttttg taattttaaa 
gatgamaaaa aaaaaagggc ggccgctcta 



gtccacctgt ccccgcagcg ccggctcgcg 60 
tagcgccccg acctcgccac catgagagcc 120 
gtcgtgagcg actccaaagg cagcaatgaa 180 
ctaaatggag gaacatgtgt gtccaacaag 240 
ccaaagaaat tcggagggca gcactgtgaa 300 
aatggtcact tttaccgagg aaaggccagc 360 
tggaactctg ccactgtcct tcagcaaacg 420 
ctgggcctgg ggaaacataa ttactgcagg 480 
cctacagttc ccagaaacct tgttaccatc 540 
agagaagtgc agcctctggt tgagtcttcc 600 
ggttggaatg acatccccta tctttctgtg 660 
cctggtgcta tgtgcaggtg ggcctaaagc 720 
gcgcagatgg aaaaaagccc tcctctcctc 780 
agactctgag gccccgcttt aagattattg 840 
cctggtttgc ggccatctac aggaggcacc 900 
gcagcctcat cagcccttgc tgggtgatca 960 
agaaggagga ctacatcgtc tacctgggtc 1020 
agatgaagtt tgaggtggaa aacctcatcc 1080 
ctcaccacaa cgacattgcc ttgctgaaga 1140 
catcccggac tatacagacc atctgcctgc 1200 
caagctgtga gatcactggc tttggaaaag 1260 
agctgaaaat gactgttgtg aagctgattt 1320 
acggctctga agtcaccacc aaaatgctgt 1380 
cctgccaggg agactcaggg ggacccctcg 1440 
ctggaattgt gagctggggc cgtggatgtg 1500 
gagtctcaca cttcttaccc tggatccgca 1560 
tctgagggtc cccagggagg aaacgggcac 1620 
cagtagagtc atctccatca gctgtaagaa 1680 
gatttgcctg tgccaccacc agggygaacg 1740 
ggtgctggct gcccagaccc ctctggccag 1800 
actgaccagc aacttgtctt tttctggact 1860 
atctcctgtg catgggtgaa gggagagcca 1920 
catggttgag aaatgaataa tttcccaatt 1980 
gagcttagcc aatgtgggag cagcggtttg 2040 
gggctctgat attccatgaa tgtatcagga 2100 
tgtgtgtggg ctgtgagtgt aagtgtgagt 2160 
tatttcctta aactgtgtgg actgtgatgc 2220 
aggtcactcc tggggcctct tgggtccccc 2280 
gcagcatgac ctgtgaccag cactgtctca 2340 
ggccagttat cccttccttt tagcctagtt 2400 
cactccttac actgaatatt tatatttcac 2460 
taaaagtgat caataaaatg tgatttttct 2520 
gagtatccct cgagg 2565 



<210> 10 
<211> 2438 
<212> DNA 
<213> HUMAN 

<400> 10 

ggggcacagc tgtgtttggc tgcagggcca agagcgctgt caagaagacc cacacgcccc 60 

cctccagcag ctgaattcct gcagctcagc agccgccgcc agagcaggac gaaccgccaa 120 

tcgcaaggca cctctgagaa cttcaggatg cagatgtctc cagccctcac ctgcctagtc 180 



5 



ctgggcctgg cccttgtctt tggtgaaggg 
gcccacctgg cctcagactt cggggtgagg 
gaccgcaacg tggttttctc accct^tggg 
acaacaggag gagaaaccca gcagcagatt 
aagggcatgg cccccgccct ccggcatctg 
gatgagatca gcaccacaga cgcgatcttc 
ttcatgcccc acttcttcag gctgttccgg 
gtggagagag ccagattcat catcaatgac 
agcaacttgc ttgggaaagg agccgtggac 
ctctacttca acggccagtg gaagactccc 
ttccacaaat cagacggcag cactgtctct 
aactatactg agttcaccac gcccgatggc 
cacggggaca ccctcagcat gttcattgct 
aggatggaac tacggggctt acaggagctt 
cagtcacatt gccatcactc ttgtactgcc 
aaaggccagt ggaagaaaca ccctttcatc 
ccccagtaca ggggtgctgc aggtggcaga 
tcctggcctg gccatctccc tccccagaaa 
tgacttgttt actcattgaa gcagatttct 
agaaaggtca gatgcgtgcc cagctcttca 
cctaaatatt tatcatatcc ttgcccttga 
ggaaaataat attatttaaa ctcgctccta 
tcaggaagtc cagccacttg actggcacac 
cacactgcca ccttgtggcc gcctgagacc 
tttttcccct tgatggaaat tgaccataca 
cggagtgggg ggacagagac tcagatgagg 
ttaggagcag aaatgcaagg ggctgcatga 
gggtgactca cagccgcatt ggtgactcac 
gagcagtgga cacgtgaggg gggggtgggt 
cttagataat atttctgaaa accta.ccagc 
gcactttggg aggccaaggc gggaggattg 
gcaacatacc aagacccccg tctctttaaa 
atatttctaa tatctttaaa tatatatata 
cacacagatg tgaaatgaat gtaatctaat 
ctgaaaaatc ctctttcttt ggggtttttc 
gtgacgtcag ccatgtacag gatccacagg 
gaattgtatg ctttttcact tttgataaat 
aaaataaata aatacgaaga atatgtcagg 



tctgctgtgc accatccccc atcctacgtg 240 
gtgtttcagc aggtggcgca ggcctccaag 300 
gtggcctcgg tgttggccat gctccagctg 360 
caagcagcta tgggattcaa gattgatgac 420 
tacaaggagc tcatggggcc atggaacaag 480 
gtccagcggg atctgaagct ggtccagggc 540 
agcacggtca agcaagtgga cttttcagag 600 
tgggtgaaga cacacacaaa aggtatgatc 660 
cagctgacac ggctggtgct ggtgaatgcc 720 
ttccccgact ccagcaccca ccgccgcctc 780 
gtgcccatga tggctcagac caacaagttc 840 
cattactacg acatcctgga actgccctac 900 
gcagacctgg ttcccactga ggccctttgc 960 
ttgtgtgcct ggtagaaact atttctgttc 1020 
tgccaccgcg gaggaggctg gtgacaggcc 1080 
tcagagtcca ctgtggcact ggccacccct 1140 
gtgaatgtcc cccatcatgt ggcccaactc 1200 
cagtgtgcat gggttatttt ggagtgtagg 1260 
gcttcctttt atttttatag gaatagagga 1320 
ccccccaatc tcttggtggg gaggggtgta 1380 
gtgcttgtta gagagaaaga gaactactaa 1440 
gtgtttcttt gtggtctgtg tcaccgtatc 1500 
acccctccgg acatccagcg tgacggagcc 1560 
ctcgcgcccc ccgcgccccc cgcgcccctc 1620 
atttcatcct ccttcagggg atcaaaagga 1680 
acagagtggt ttccaatgtg ttcaatagat 1740 
cctaccagga cagaactttc cccaattaca 1800 
ttcaatgtgt catttccggc tgctgtgtgt 1860 
gagagagaca ggcagctcgg attcaactac 1920 
cagagggtag ggcacaaaga tggatgtaat 1980 
cttgagccca ggagttcaag accagcctgg 2040 
aatatatata ttttaaatat acttaaatat 2100 
tattttaaag accaatttat gggagaattg 2160 
agaagcctaa tcagcccacc atgttctcca 2220 
tttctttctt ttttgatttt gcactggacg 2280 
ggtggtgtca aatgctattg aaattgtgtt 2340 
aaacatgtaa aaatgtttca aaaaaataat 2400 
acaaaaaa 2438 



<210> 11 
<211> 2522 
<212> DNA 
<213> HUMAN 



<400> 11 

ggggcacagc 

cctccagcag 

tcgcaaggca 

ctgggcctgg 

gcccacctgg 

gaccgcaacg 

acaacaggag 

aagggcatgg 

gatgagatca 

ttcatgcccc 

gtggagagag 

agcaacttgc 

ctctacttca 

ttccacaaat 

aactatactg 



tgtgtttggc 
ctgaattcct 
cctctgagaa 
cccttgtctt 
cctcagactt 
tggttttctc 
gagaaaccca 
cccccgccct 
gcaccacaga 
acttcttcag 
ccagattcat 
ttgggaaagg 
acggccagtg 
cagacggcag 
agttcaccac 



tgcagggcca 
gcagctcagc 
cttcaggatg 
tggtgaaggg 
cggggtgagg 
accctatggg 
gcagcagatt 
ccggcatctg 
cgcgatcttc 
gctgttccgg 
catcaatgac 
agccgtggac 
gaagactccc 
cactgtctct 
gcccgatggc 



agagcgctgt 
agccgccgcc 
cagatgt etc 
tctgctgtgc 
gtgtttcagc 
gtggcct egg 
caagcagcta 
tacaaggagc 
gtccagcggg 
agcacggtca 
tgggtgaaga 
cagctgacac 
ttccccgact 
gtgcccatga 
cattactacg 



caagaagacc 
agagcaggac 
cagccct cac 
accatccccc 
aggtggcgca 
tgttggccat 
tgggattcaa 
tcatggggcc 
atctgaagct 
agcaagtgga 
cacacacaaa 
ggctggtgct 
ccagcaccca 
tggctcagac 
acatcctgga 



cacacgcccc 
gaaccgccaa 
ctgcctagtc 
atcctacgtg 
ggcctccaag 
gctccagctg 
gattgatgac 
atggaacaag 
ggtccagggc 
cttttcagag 
aggtatgatc 
ggtgaatgcc 
ccgccgcctc 
caacaagttc 
actgccctac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



6 



cacggggaca ccctcagcat gttcattgct 
gccctcacca acattctgag tgcccagctc 
ctgccccgcc tcctggttct gcccaagttc 
cccctagaga acctgggaat gaccgacatg 
ctttcagacc aagagcctct ccacgtcgcg 
aacgagagtg gcacggtggc ctcctcatcc 
cccgaggaga tcatcatgga cagacccttc 
gcatgggtta ttttggagtg taggtgactt 
ttttattttt ataggaatag aggaagaaag 
aatctcttgg tggggagggg tgtacctaaa 
gttagagaga aagagaacta ctaaggaaaa 
ctttgtggtc tgtgtcaccg tatctcagga 
ccggacatcc agcgtgacgg agcccacact 
ccccccgcgc cccccgcgcc cctctttttc 
tcctccttca ggggatcaaa aggacggagt 
tggtttccaa tgtgttcaat agatttagga 
aggacagaac tttccccaat tacagggtga 
gtgtcatttc cggctgctgt gtgtgagcag 
gacaggcagc tcggattcaa ctaccttaga 
gtagggcaca aagatggatg taatgcactt 
cccaggagtt caagaccagc ctgggcaaca 
tatattttaa atatacttaa atatatattt 
aaagaccaat ttatgggaga attgcacaca 
ctaatcagcc caccatgttc tccactgaaa 
tcttttttga ttttgcactg gacggtgacg 
gtcaaatgct attgaaattg tgttgaattg 
gtaaaaatgt ttcaaaaaaa taataaaata 
aa 



gccccttatg aaaaagaggt gcctctctct 960 
atcagccact ggaaaggcaa catgaccagg 1020 
tccctggaga ctgaagtcga cctcaggaag 1080 
ttcagacagt ttcaggctga cttcacgagt 1140 
caggcgctgc agaaagtgaa gatcgaggtg 1200 
acagctgtca tagtctcagc ccgcatggcc 1260 
ctctttgtgg tccctcccca gaaacagtgt 1320 
gtttactcat tgaagcagat ttctgcttcc 1380 
gtcagatgcg tgcccagctc ttcacccccc 1440 
tatttatcat atccttgccc ttgagtgctt 1500 
taatattatt taaactcgct cctagtgttt 1560 
agtccagcca cttgactggc acacacccct 1620 
gccaccttgt ggccgcctga gaccctcgcg 1680 
cccttgatgg aaattgacca tacaatttca 1740 
ggggggacag agactcagat gaggacagag 1800 
gcagaaatgc aaggggctgc atgacctacc 1860 
ctcacagccg cattggtgac tcacttcaat 1920 
tggacacgtg aggggggggt gggtgagaga 1980 
taatatttct gaaaacctac cagccagagg 2040 
tgggaggcca aggcgggagg attgcttgag 2100 
taccaagacc cccgtctctt taaaaatata 2160 
ctaatatctt taaatatata tatatatttt 2220 
gatgtgaaat gaatgtaatc taatagaagc 2280 
aatcctcttt ctttggggtt tttctttctt 2340 
tcagccatgt acaggatcca caggggtggt 2400 
tatgcttttt cacttttgat aaataaacat 2460 
aataaatacg aagaatatgt caggacaaaa 2520 

2522 



<210> 12 
<211> 2272 
<212> DNA 
<213> HUMAN 

<400> 12 

ctcttccccg ctgagaggsa cagccastgc gwctccwccc tccagctcga cggcagccgc 60 

ccggccgaca gccccgagac gacagcccgg cgcgtcccgg tccccacctc cgaccaccgc 120 

cagcgctcca ggccccgcgc tccccgctcg ccgccaccgc gccctccgct ccgcccgcag 180 

tgccaaccat gaccgccgcc agtatgggcc ccgtccgcgt cgccttcgtg gtcctcctcg 240 

ccctctgcag ccggccggcc gtcggccaga actgcagcgg gccgtgccgg tgcccggacg 300 

agccggcgcc gcgctgcccg gcgggcgtga gcctcgtgct ggacggctgc ggctgctgcc 360 

gcgtctgcgc caagcagctg ggcgagctgt gcaccgagcg cgacccctgc gacccgcaca 420 

agggcctctt ctgtgacttc ggctccccgg ccaaccgcaa gatcggcgtg tgcaccgcca 480 

aagatggtgc tccctgcatc ttcggtggta cggtgtaccg cagcggagag tccttccaga 540 

gcagctgcaa gtaccagtgc acgtgcctgg acggggcggt gggctgcatg cccctgtgca 600 

gcatggacgt tcgtctgccc agccctgact gccccttacc gactggaaga cacgtttggc 660 

ccagacccaa ctatgattag agccaactgc ctggtccaga ccacagagtg gagcgcctgt 720 

tccaagacct gtgggatggg catctccacc cgggttacca atgacaacgc ctcctgcagg 780 

ctagagaagc agagccgcct gtgcatggtc aggccttgcg aagtgacctg gaagagaaca 840 

ttaagaaggg caaaaagtgc atccgtactc ccaaaatctc caagcctatc aagtttgagc 900 

tttctggctg caccagcatg aagacatacc gagctaaatt ctgtggagta tgtaccgacg 960 

gccgatgctg caccccccac agaaccacca ccctgccggt ggagttcaag tgccctgacg 1020 

gcgaggtcat gaagaagaac atgatgttca tcaagacctg tgcctgccat tacaactgtc 1080 

ccggagacaa tgacatcttt gaatcgctgt actacaggaa gatgtacgga gacatggcat 1140 

gaagccagag agtgagagac .attaactcat tagactggaa cttgaactga ttcacatctc 1200 

atttttccgt aaaaatgatt tcagtagcac aagttattta aatctgtttt tctaactggg 1260 

ggaaaagatt cccacccaat tcaaaacatt gtgccatgtc aaacaaatag tctatcaacc 1320 

ccagacactg gtttgaagaa tgttaagact tgacagtgga actacattag tacacagcac 1380 

cagaatgtat attaaggtgt ggctttagga gcagtgggag ggtaccagca gaaaggttag 1440 

tatcatcaga tagcatctta tacgagtaat atgcctgcta tttgaagtgt aattgagaag 1500 
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gaaaatttta gcgtgctcac tgacctgcct 

ctccagccat caagagactg agtcaagttg 

tgacattctg attcgaatga cactgttcag 

gcttgtggca agtgaatttg cctgtaacaa 

attgtgtgtg tgtgtgtgtg tgtatatata 

aagttgtttg tgccttttta tttttgtttt 

atttctgaac accataggta gaatgtaaag 

tacttatatg gaaattctgc tcagatagaa 

aggggaggca tcagtgtcct tggcaggctg 

ttttaatgaa caaatggcct ttattaaaaa 

ttcacctgga agcatttgtt tctactttga 

gagagtgtga ccaaaagtta catgtttgca 

tttctataaa aaatgtcgac cggccgctaa 



gtagccccag tgacagctag gatgtgcatt 1560 

ttccttaagt cagaacagca gactcagctc 1620 

gaatcggaat cctgtcgatt agactggaca 1680 

gccagatttt ttaaaattta tattgtaaat 1740 

tatatatgta cagttatcta agttaattta 1800 

taatgctttg atatttcaat gttagcctca 1860 

cttgtctgat cgttcaaagc atgaaatgga 1920 

tgacagtccg tcaaaacaga ttgtttgcaa 1980 

atttctaggt aggaaatgtg gtagcctcac 2040 

ctgagtgact ctatatagct gatcagtttt 2100 

tatgactgtt tttcggacag tttatttgtt 2160 

cctttctagt tgaaaataaa gtgtatattt 2220 

tttagtagta gtaggcggca gc 2272 



<210> 13 
<211> 1153 
<212> DNA 
<213> HUMAN 



<400> 13 

tgaaatatgg agaatagtct tagatgtgtt 
ggagcttcct tgctcagcgc gcatcttcaa 
gcactgcgct tcctctcaga accttctgat 
ctcgactgct ccgcggagtc cgaccgagga 
ttcatctggc cttgggaatg gatgaaagga 
tacaaaacat acttcattcc agacaccaca 
catctttagg agattctggc tcaattatta 
cactgaggtt cctttcacag acagaatctg 
tcaagtgtga agtcattggg gagcccatgc 
acctgactcc aatcccaggt gactcccgag 
tcagccgact ccaaccgggg gacattggaa 
gctcaagaac aggaaatgaa gcagaagtca 
agctgtattt tctgcaaaga ccatccaatg 
tggaatgttg tgtttctggc tatcctccac 
tcatccaact caggtctaaa aagtattctt 
atgtgacaga tgatgacagt ggaatgtata 
ttagtgcctc tgcagagctc acagtcttgg 
tttgggtata aggattgcac taggttctgg 
tcagtgggct tcgtataatt acaattcgta 
acttttwwtt aaa 



tgggtaccca agctggcttt tgtactcttc 60 
gtaaccggtt ttcaaattaa agctttcaca 120 
gccgtcacaa tgcggggagg aaatgtcctc 180 
gttccagtga tcaagtggaa gaaagatgca 240 
agcagcaact ttcaaatggg tctctgctga 300 
agccagatga gggactttac caatgtgagg 360 
gtcggacagc aaaagttgca gtagcaggac 420 
tcacagcctt catgggagac acagtgctac 480 
caacaatcca ctggcagaag aaccaacaag 540 
tggtggtctt gccctctgga gcattgcaga 600 
tttaccgatg ctcagctcga aatccagcca 660 
gaattttatc agatccagga ctgcatagac 720 
tagtagccat tgaaggaaaa gatgctgtcc 780 
caagttttac ctggttacga ggcgaggaag 840 
tattgggtgg aagcaacttg cttatctcca 900 
cctgtgttgt cacatataaa aatgagaata 960 
taatcatcga taaagtattg gtggacacct 1020 
gaagactaca gaaacatctg aaacctggct 1080 
aaatcagttg gccwaatact gtctgaataa 1140 

1153 



<210> 14 
<211> 4459 
<212> DNA 
<213> HUMAN 



<400> 14 

tgaaatatgg 

ggagcttcct 

gcactgcgct 

ctcgactgct 

ttcatctggc 

tacaaaacat 

catctttagg 

cactgaggtt 

tcaagtgtga 

acctgactcc 

tcagccgact 

gctcaagaac 



agaatagtct 
tgctcagcgc 
tcctctcaga 
ccgcggagtc 
cttgggaatg 
acttcattcc 
agattctggc 
cctttcacag 
agtcattggg 
aatcccaggt 
ccaaccgggg 
aggaaatgaa 



tagatgtgtt 
gcatcttcaa 
accttctgat 
cgaccgagga 
gatgaaagga 
agacaccaca 
tcaattatta 
acagaatctg 
gagcccatgc 
gactcccgag 
gacattggaa 
gcagaagtca 



tgggtaccca 
gtaaccggtt 
gccgtcacaa 
gttccagtga 
agcagcaact 
agccagatga 
gtcggacagc 
tcacagcctt 
caacaatcca 
tggtggtctt 
tttaccgatg 
gaattttatc 



agctggcttt 
ttcaaattaa 
tgcggggagg 
tcaagtggaa 
ttcaaatggg 
gggactttac 
aaaagttgca 
catgggagac 
ctggcagaag 
gccctctgga 
ctcagctcga 
agatccagga 



tgtactcttc 
agctttcaca 
aaatgtcctc 
gaaagatgca 
tctctgctga 
caatgtgagg 
gtagcaggac 
acagtgctac 
aaccaacaag 
gcattgcaga 
aatccagcca 
ctgcatagac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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agctgtattt tctgcaaaga ccatccaatg 
tggaatgttg tgtttctggc tatcctccac 
tcatccaact caggtctaaa aagtattctt 
atgtgacaga tgatgacagt ggaatgtata 
ttagtgcctc tgcagagctc acagtcttgg 
acctgtatgc ctatgaaagc atggatattg 
tgcccactgt gaattggatg aagaatggag 
tagtgggagg aagcaactta cggatacttg 
aatgtgtggc tgaaaatgag gctggaaatg 
agcctgcaat cccaagctcc agtgtcctcc 
tggtttccag ccgatttgtc cgtctcagct 
ttcaaacttt cacggtcttt ttctccagag 
caacacagcc tgggtccctt cagctcactg 
cctttcgagt tgtggcttac aatgaatggg 
tggccacaca gcctgagttg caagttccag 
cctcacctac ctcaattctt attacctggg 
aaggttacag attgttctgc actgaggtgt 
atggactatc ttataaactg gaaggcctga 
tagcttataa tcgctatggt ccgggcgtct 
ctgacgtgcc aagtgccccg cctcagaacg 
tcaaagttag ctggctgcct cctccatcag 
aaattcgaca cagaaagacg acccgcaggg 
tctggtacct attcacagga ctggagaaag 
tgacagtcaa tggtactgga ccaccttcca 
atctagatga atctcaagtt cctgatcaac 
actgcatcat catgagttgg actcctccct 
ttatcggtta tggcgttggg agcccttacg 
gatattattc cattgagagg ttagagtcaa 
ttaacaatgc cggagaagga gttcctcttt 
atcccactga cccagttgat tattatcctt 
atctctccac ccccatgctc ccaccagtag 
ctgtgagggt cagctgggca gacaactctg 
gactttacac cgtccggtgg agaaccagct 
acacaacatc tctaagttac acagcaacag 
cggtcatggt aacaaaaaac agaaggtcca 
cgtatgaagc agcccccacc tctgctccca 
agcctcgtgc cgtcattgtg agttggcagc 
cttacatctt attttatacc ttggacaaga 
caatcagtgg tgataggctt actcatcaaa 
-actttcgaat tcaagcacga aattcaaaag 
tcaggactct gaaagtggaa caccctgaca 
atggaggtta ttggccagtt gatactaatt 
caattggaca aatgcacccc" ccgcatggca 
ttgtgatcat tgtggtcacc gttggtgtca 
tgatttgcac ccgacgctct tcagcccagc 
gcaaaaggaa gggcagccag aaggacctcc 
aaatggagat gaaaaatatt gaaaagccat 
ccatccaaag ttgccaagac ctcacaccag 
gaagcaaaag cacctctcat tcaggtcaag 
ctctggagag gtcgctggct gcacgccgag 
atgcccagtc caacaatcct gctgtcgtga 
cccagtaccc aggaatcctc ccgtctccca 
tccggcctgt gccattccca acactctcag 
agtcagtgag tgaaggacca actacccaac 
agcattctag cagcgaggag gcaccaagca 
ctcacccact ccgcagcttt gctaatcctt 
cgaaagtccc ttacacacca cttttgtctc 
tgaaaacagc ctcccttggg ttggctggaa 
tgccaacagc ccctgaagtg tctgaggaga 
taagtgcatc tttaaaattc atgcttcatc 
gcatctaaac caatgagtac tgtccattca 



tagtagccat tgaaggaaaa gatgctgtcc 780 
caagttttac ctggttacga ggcgaggaag 840 
tattgggtgg aagcaacttg cttatctcca 900 
cctgtgttgt cacatataaa aatgagaata 960 
ttccgccatg gtttttaaat catccttcca 1020 
agtttgaatg tacagtctct ggaaagcctg 1080 
atgtggtcat tcctagtgat tattttcaga 1140 
gggtggtgaa gtcagatgaa ggcttttatc 1200 
cccagaccag tgcacagctc attgtcccta 1260 
cttcggctcc cagagatgtg gtccctgtct 1320 
ggcgcccacc tgcagaagcg aaagggaaca 1380 
aaggtgacaa cagggaacga gcattgaata 1440 
tgggaaacct gaagccagaa gccatgtaca 1500 
gaccgggaga gagttctcaa cccatcaagg 1560 
ggccagtaga aaacctgcaa gctgtatcta 1620 
aaccccctgc ctatgcaaac ggtccagtcc 1680 
ccacaggaaa agaacagaat atagaggttg 1740 
aaaaattcac cgaatatagt cttcgattct 1800 
ctactgatga tataacagtg gttacacttt 1860 
tctccctgga agtggtcaat tcaagaagta 1920 
gaacacaaaa tggatttatt accggctata 1980 
gtgagatgga aacactggag ccaaacaacc 2040 
gaagtcagta cagtttccag gtgtcagcca 2100 
actggtatac tgcagagact ccagagaatg 2160 
caagctctct tcatgtgagg ccccagacta 2220 
tgaacccaaa catcgtggtg cgaggttata 2280 
ctgagacagt gcgtgtggac agcaagcagc 2340 
gttcccatta tgtaatctcc ctaaaagctt 2400 
atgaaagtgc caccaccagg tctataaccg 2460 
tgcttgatga tttccccacc tcggtcccag 2520 
gtgtacaggc tgtggctctt acccatgatg 2580 
tccctaagaa ccaaaagacg tctgaggtgc 2640 
tttctgcaag tgcaaaatac aagtcagaag 2700 
gcctcaaacc aaacacaatg tatgaattct 2760 
gtacttggag catgactgca catgccacca 2820 
aggactttac agtcattact agggaaggga 2880 
ctcccttgga agccaatggg aaaattactg 2940 
acatcccaat tgatgactgg attatggaaa 3000 
tcatggatct caaccttgat actatgtatt 3060 
gagtggggcc actctctgat cccatcctct 3120 
aaatggctaa tgaccaaggt cgtcatggag 3180 
tgattgatag aagcacccta aatgagccgc 3240 
gtgtcactcc tcagaagaac agcaacctgc 3300 
tcacagtgct ggtagtggtc atcgtggctg 3360 
agagaaagaa acgggccacc cacagtgctg 3420 
gaccccctga tctttggatc catcatgaag 3480 
ctggcactga ccctgcagga agggactctc 3540 
tcagccacag ccagtcagaa acccaactgg 3600 
acactgagga agcagggagc tctatgtcca 3660 
ccccccgggc caagctcatg attcccatgg 3720 
gcgccatccc ggtgccaacg ctagaaagtg 3780 
cctgtggata tccccacccg cagttcactc 3840 
tggaccgagg tttcggagca ggaagaagtc 3900 
aaccacctat gctgccccca tctcagcctg 3960 
gaaccatccc cacagcttgt gttcgaccaa 4020 
tgctacctcc accaatgagt gcaatagaac 4080 
agccagggcc cactcttcct aagacccatg 4140 
aagcaagatc ccctttgctt cctgtgtctg 4200 
gccacaaacc aacagaggat tcagccaatg 4260 
aagggacaga ttgactggcg ctgtgtaata 4320 
ccccaggcca ccccaggcca ccccagtcca 4380 
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ttgttgttct tattgttgtt tgttcctttg ttttcctgca cttaaatatg aagaggaaat 4449 
taaatctatg atattaaaa 4459 



<210> 15 
<211> 3744 
<212> DNA 
<213> HUMAN 

<400> 15 

ccgagtgaca aggaggtggg agagggtagc agcatgggct acgcggttgg ctgccctcag 60 
tccccctgct gctgaagctg ccctgcccat gcccacccag gccgtggggc caggggcctg 120 
ccagggctag gagtgggcct gccgttcatg ggtctctagg gatttccgar atgcctggga 180 
agagaggctt gggctggtgg tgggcccggc tgcccctttg cctgctcctc agcctttacg 240 
gcccctggat gccttcctcc ctgggaaagc ccaaaggcca- ccctcacatg aattccatcc 300 
gcatagatgg ggacatcaca ctgggaggcc tgttcccggt gcatggccgg ggctcagagg 360 
gcaagccctg tggagaactt aagaaggaaa agggcatcca ccggctggag gccatgctgt 420 
tcgccctgga tcgcatcaac aacgacccgg acctgctgcc taacatcacg ctgggcgccc 480 
gcattctgga cacctgctcc agggacaccc atgccctcga gcagtcgctg acctttgtgc 540 
aggcgctcat cgagaaggat ggcacagagg tccgctgtgg cagtggcggc ccacccatca 600 
tcaccaagcc tgaacgtgtg gtgggtgtca tcggtgcttc agggagctcg gtctccatca 660 
tggtggccaa catccttcgc ctcttcaaga taccccagat cagctacgcc tccacagcgc 720 
cagacctgag tgacaacagc cgctacgact tcttctcccg cgtggtgc-ec -tcggacacgt 780 
accaggccca ggccatggtg gacatcgtcc gtgccctcaa gtggaactat gtgtccacag 840 
tggcctcgga gggcagctat ggtgagagcg gtgtggaggc cttcatccag aagtcccgtg 900 
aggacggggg cgtgtgcatc gcccagtcgg tgaagatacc acgggagccc aaggcaggcg 960 
agttcgacaa gatcatccgc cgcctcctgg agacttcgaa cgccagggca gtcatcatct 1020 
ttgccaacga -ggatgacatc aggcgtgtgc tggaggcagc acgaagggcc aaccagacag 1080 
gccatttctt ctggatgggc tctgacagct ggggctccaa gattgcacct gtgctgcacc 1140 
tggaggaggt ggctgagggt gctgtcacga tcctccccaa gaggatgtcc gtacgagacc 1200 
gtgagcgaat tgggcaggat tcagcttatg agcaggaggg gaaggtgcag tttgtgatcg 1260 
atgccgtgta cgccatgggc cacgcgctgc acgccatgca ccgtgacctg tgtcccggcc 1320. 
gcgtggggct ctgcccgcgc atggaccctg tagatggcac ccagctgctt aagtacatcc 1380 
gaaacgtcaa cttctcaggc atcgcaggga accctgtgac cttcaatgag aatggagatg 1440 
cgcctgggcg ctatgacatc tacca-atacc agctgcgcaa cgattctgcc gagtacaagg 1500 
tcattggctc ctggactgac cacctgcacc ttagaataga gcggatgcac tggccgggga 1560 
gcgggcagca gctgccccgc tccatctgca gcctgccctg ccaaccgggt gagcggaaga 1620 
agacagtgaa gggcatgcct tgctgctggc actgcgagcc ttgcacaggg taccagtacc 1680 
aggtggaccg ctacacctgt aagacgtgtc cctatgacat gcggcccaca gagaaccgca 1740 
cgggctgccg gcccatcccc atcatcaagc ttgagtgggg ctcgccctgg gccgtgctgc 1800 
ccctcttcct ggccgtggtg ggcatcgctg ccacgttgtt cgtggtgatc acctttgtgc 1860 
gctacaacga cacgcccatc gtcaaggcct cgggccgtga actgagctac gtgctgctgg 1920 
caggcatctt cctgtgctat gccaccacct tcctcatgat cgctgagccc gaccttggca 1980 
cctgctcgct gcgccgaatc ttcctgggac tagggatgag catcagctat gcagccctgc 2040 
tcaccaagac caaccgcatc taccgcatct tcgagcaggg caagcgctcg gtcagtgccc 2100" 
cacgcttcat cagccccgcc tcacagctgg ccatcacctt cagcctcatc tcgctgcagc 2160 
tgctgggcat ctgtgtgtgg tttgtggtgg acccctccca ctcggtggtg gacttccagg 2220 
accagcggac actcgacccc cgcttcgcca ggggtgtgct caagtgtgac atctcggacc 2280 
tgtcgctcat ctgcctgctg ggctacagca tgctgctcat ggtcacgtgc accgtgtatg 2340 
ccatcaagac acgcggcgtg cccgagacct tcaatgaggc caagcccatt ggcttcacca 2400 
tgtacaccac ttgcatcgtc tggctggcct tcatccccat cttctttggc acctcgcagt 2460 
cggccgacaa gctgtacatc cagacgacga cgctgacggt ctcggtgagt ctgagcgcct 2520 
cggtgtccct gggaatgctc tacatgccca aagtctacat catcctcttc cacccggagc 2580 
agaacgtgcc caagcgcaag cgcagcctca aagccgtcgt tacggcggcc accatgtcca 2640 
acaagttcac gcagaagggc aacttccggc ccaacggaga ggccaagtct gagctctgcg 2700 
agaaccttga ggccccagcg ctggccacca aacagactta cgtcacttac accaaccatg 2760 
caatctagcg agtccatgga gctgagcagc aggaggagga gccgtgaccc tgtggaaggt 2820 
gcgtcgggcc agggccacac ccaagggcec agctgtcttg cctgcccgtg ggcacccacg 2880 
gacgtggctt ggtgctgagg atagcagagc ccccagccat cactgctggc agcctgggca 2940 
aaccgggtga gcaacaggag gacgaggggc cggggcggtg ccaggctacc acaagaacct 3000 
gcgtcttgga ccattgcccc tcccggcccc aaaccacagg ggctcaggtc gtgtgggccc 3060 
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cagtgctaga tctctccctc ccttcgtctc tgtctgtgct gttggcgacc cctctgtctg 3120 

tctccagccc tgtctttctg ttctcttatc tctttgtttc accttttccc tctctggcgt 3180 

ccccgyctgc ttgtactctt ggccttttct gtgtctcctt tctggctctt gcctccgcct 3240 

ctctctctca tcctctttgt cctcagctcc tcctgctttc ttgggtccca ccagtgtcac 3300 

ttttctgccg tt'ttctttcc tgttctcctc tgcttcattc tcgtccagcc attgctcccc 3360 

tctccctgcc acccttcccc agttcaccaa accttacatg ttgcaaaaga gaaaaaagga 3420 

aaaaaaatca aaacacaaaa aagccaaaac gaaaacaaat ctcgagtgtg ttgccaagtg 3480 

ctgcgtcctc ctggtggcct ctgtgtgtgt ccctgtggcc cgcagcctgc ccgcctgccc 3540 

cgcccatctg ccgtgtgtct tgcccgcctg ccccgcccgt ctgccgtctg tcttgcccgc 3600 

ctgcccgcct gcccctcctg ccgaccacac ggagttcagt gcctgggtgt ttggtgatgg 3660 

ttattgacga caatgtgtag cgcatgattg tttttatacc aagaacattt ctaataaaaa 3720 

taaacacatg gttttgcaaa aaaa 3744 



<210> 16 
<211> 2899 
<212> DNA 
<213> HUMAN 



<400> 16 

cgcgggagcc 

gagcgttgct 

ggttggattt 

gtagacggcg 

ctccctgaag 

tcttccctgg 

agctgtgttt 

gaactactcc 

aaaactgagc 

atggattctg 

agtccagcta 

cat cgtcctt 

gggagctccg 

gagaaaagcc 

ggaatcaact 

atcacccagg 

cacagccctg 

aaatcctcaa 

aataatgtca 

tctgtttcca 

actgtgggat 

tctgctggat 

gctcaagagg 

actggccagc 

tcatgtctag 

caagaatcaa 

ccgtttccat 

eta tcaggaa 

ttcccagtgg 

ccttcctctg 

gctcaaggta 

tcctttcctc 

agaagaagtg 

ttacgaagtg 

gaggcttcag 

agagcagtgg 

aagattcgac 

aatttaggag 

ccacagcagc 

aegtacateg 

acaatctcac 



aacttcaggc 
gcctcgggct 
aggggggctc 
agagaeggae 
gtctagatat 
gacctacaga 
ccggtgctat 
cttgccttca 
tggaatctaa 
taagagatgc 
agatttatca 
ccactctaag 
tgaatggtgg 
cgtctgtttg 
ctgtgtcctc 
gaact cctct 
cacatgetag 
tttccagccc 
ctctgagatc 
gcccttcgaa 
ctatctgtag 
ccagtacatt 
tcccttttcc 
ttaatattgt 
gaggaaatag 
ccaagcattc 
ttatggatgg 
ttttaggacc 
gtattaaaca 
ctattgttgg 
caatatcttt 
ctgtcaatac 
atgggtatcc 
tttctactgg 
gatgecatta 
aagggcaaca 
gaaagaattg 
cacgaaagtc 
agcagccccc 
ctcctgcaaa 
gagcgctcac 



tgctcagagg 
ctcccgctgc 
cgcagcaggg 
cccggccgag 
ggaaagaegg 
gaggaccgat 
tccaaacaac 
gcaagacaat 
ggaact ttca 
tgacta ttcc 
gaatgttgaa 
ttgtgtgaac 
cgtcatgcgc 
cagccctctg 
caccacagcc 
gacatgctcc 
caatgtgggc 
tccaagtcac 
ctctgtgtct 
cactaataac 
ccctgtaaac 
gcgggatgtg 
taagactgag 
ccagt acata 
caaaataaat 
atgttcaggc 
ctegtatttt 
acctgtgccc 
agaaccagat 
ggtgaattca 
atcacgatcg 
tttagtggag 
ggtcttagaa 
atcttcaaga 
tggggtagtc 
caactattta 
tcctgcttgc 
aaagaagttg 
acccccaccc 
agaaccctcg 
acctt ccccc 



aagcccgtgc 
agggagagcg 
gtttcgtggc 
gcagggatgg 
tggggtcaag 
gagaataact 
agtactcaag 
aatcggcctg 
gcaactgtag 
tatgagcagc 
cagctggtga 
acgccct tga 
gccattgtta 
aacatgacat 
agctttggca 
ectaatgetg 
tctcctctct 
tgcagtgtaa 
agccctgcaa 
agatccacgc 
aatgecttea 
gttcccagtc 
gaagtagaga 
aaaccagaac 
teggattett 
acctctttta 
tcctttatgg 
ggctttgatg 
gaegggaget 
ggtggacagt 
gctagagacc 
tcatggaaat 
tacattccag 
ccttcaaaaa 
acctgtggca 
tgtgctggaa 
agacttcaga 
ggaaagttaa 
ccacccccgc 
gtcaacacag 
gttatggtcc 



agtcagtcac 
gcactcgctg 
ggtggcaagc 
agaccaaagg 
tttctcaggc 
acatggagat 
gaagcagcaa 
ggattttaac 
ctgagtccat 
agaaccaaca 
aattttacaa 
gatcatttat 
aaagecctat 
cttcggtttg 
gttttccagt 
aaaatcgagg 
caagtcegtt 
aatctccagt 
atattaacaa 
tttccagtcc 
gctacactgc 
cagacacgca 
gtgccatctc 
cagatggagc 
cattctcagt 
aagggaatcc 
atgataaaga 
gtaactgtga 
attacccaga 
ccttccacta 
aatctttcca 
cacacggcga 
aaaatgtatc 
tatgtttggt 
getgeaaagt 
gaaatgattg 
aatgtcttca 
aagggattca 
aaageccaga 
cactggttcc 
t tgaaaacat 



ctgggtgcaa 
gcctggatgt 
getgeaacag 
ctaccacagt 
tgtggagcgt 
tgtcaacgta 
agaaaaacaa 
atctgatat t 
gggtttatat 
aggaagcatg 
aggaaatggc 
gtctgactct 
catgtgtcat 
cagccctgct 
gcacagccca 
ctccaggtcg 
aagtagcatg 
ctccagtccc 
ctcaaggtgc 
ggcagccagt 
ttctggcacc 
ggagaaaggt 
aaatggtgtg 
ttttagcagc 
accaataaag 
aacagtaaac 
ctattattcc 
aggcagegga 
ggccagcatc 
caggattggt 
acacctgagt 
cctgtcgtct 
aagctctact 
gtgtgggga t 
tttcttcaaa 
catcattgat 
agctggaatg 
cgaggagc^ag 
ggaagggaca 
tcagctctcc 
tgaacctgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

'2 34 0 

2400 

2460 
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attgtatatg caggctatga cagctcaaaa ccagatacag ccgaaaatct gctctccacg 2520 

ctcaaccgct tagcaggcaa acagatgatc caagtcgtga agtgggcaaa ggtacttcca 2580 

ggatttaaaa acttgcctct tgaggaccaa attaccctaa tccagtattc ttggatgtgt 2640 

ctatcatcat ttgccttgag ctggagatcg tacaaacata cgaacagcca atttctctat 2700 

tttgcaccag acctagtctt taatgaactg ttggcgagag tgagagaggg ttgaagtctt 2760 

agtggcactt cacatgcctc tgtggattct ctgagtgtct ctctgggatg aaggtgactt 2820 

ccacccacac catcttgcgt ctgaagcact agttgtgtga gggcagctcc attagacgag 2880 

tacacaactc agttgcaaa 2899 



<210> 17 

<211> 2780 

<212> DNA 

<213> HUMAN 

<400> 17 

cgcgggagcc aacttcaggc tgctcagagg 
gagcgttgct gcctcgggct ctcccgctgc 
ggttggattt aggggggctc cgcagcaggg 
gtagacggcg agagacggac cccggccgag 
ctccctgaag gtcta'gatat ggaaagacgg 
tcttccctgg gacctacaga gaggaccgat 
agctgtgttt ccggtgctat tccaaacaac 
gaactactcc cttgccttca gcaagacaat 
aaaactgagc tggaatctaa ggaactttca 
atggattctg taagagatgc tgactattcc 
agtccagcta agatttatca gaatgttgaa 
catcgtcctt ccactctaag ttgtgtgaac 
gggagctccg tgaatggtgg cgtcatgcgc 
gagaaaagcc cgtctgtttg cagccctctg 
ggaatcaact ctgtgtcctc caccacagcc 
atcacccagg gaactcctct gacatgctcc 
cacagccctg cacatgctag caatgtgggc 
aaatcctcaa tttccagccc tccaagtcac 
aataatgtca ctctgagatc ctctgtgtct 
tctgtttcca gcccttcgaa cactaataac 
actgtgggat ctatctgtag ccctgtaaac 
tctgctggat ccagtacatt gcgggatgtg 
gctcaagagg tcccttttcc taagactgag 
actggccagc ttaatattgt ccagtacata 
tcatgtctag gaggaaatag caaaataaat 
caagaatcaa ccaagcattc atgttcaggc 
ccgtttccat ttatggatgg ctcgtatttt 
ctatcaggaa ttttaggacc acctgtgccc 
ttcccagtgg gtattaaaca agaaccagat 
ccttcctctg ctattgttgg ggtgaattca 
gctcaaggta caatatcttt atcacgatcg 
tcctttcctc ctgtcaatac tttagtggag 
agaagaagtg atgggtatcc ggtcttagaa 
ttacgaagtg tttctactgg atcttcaaga 
gaggcttcag gatgccatta tggggtagtc 
agagcagtgg aagggcaaca caactattta 
aagattcgac gaaagaattg tcctgcttgc 
aatttaggag cacgaaagtc aaagaagttg 
ccacagcagc agcagccccc acccccaccc 
acgtacatcg ctcctgcaaa agaaccctcg 
acaatctcac gagcgctcac accttccccc 
attgtatatg caggctatga cagctcaaaa 
ctcaaccgct tagcaggcaa acagatgatc 
ggatttaaaa acttgcctct tgaggaccaa 
ctatcatcat ttgccttgag ctggagatcg 



aagcccgtgc agtcagtcac ctgggtgcaa 60 
agggagagcg gcactcgctg gcctggatgt 120 
gtttcgtggc ggtggcaagc gctgcaacag 180 
gcagggatgg agaccaaagg ctaccacagt 240 
tggggtcaag tttctcaggc tgtggagcgt 300 
gagaataact acatggagat tgtcaacgta 360 
agtactcaag gaagcagcaa agaaaaacaa 420 
aatcggcctg ggattttaac atctgatatt 480 
gcaactgtag ctgagtccat gggtttatat 540 
tatgagcagc agaaccaaca aggaagcatg 600 
cagctggtga aattttacaa aggaaatggc 660 
acgcccttga gatcatttat gtctgactct 720 
gccattgtta aaagccctat catgtgtcat 780 
aacatgacat cttcggtttg cagccctgct 840 
agctttggca gttttccagt gcacagccca 900 
cctaatgctg aaaatcgagg ctccaggtcg 960 
tctcctctct caagtccgtt aagtagcatg 1020 
tgcagtgtaa aatctccagt ctccagtccc 1080 
agccctgcaa atattaacaa ctcaaggtgc 1140 
agatccacgc tttccagtcc ggcagccagt 1200 
aatgccttca gctacactgc ttctggcacc 1260 
gttcccagtc cagacacgca ggagaaaggt 1320 
gaagtagaga gtgccatctc aaatggtgtg 1380 
aaaccagaac cagatggagc ttttagcagc 1440 
tcggattctt cattctcagt accaataaag 1500 
acctctttta aagggaatcc aacagtaaac 1560 
tcctttatgg atgataaaga ctattattcc 1620 
ggctttgatg gtaactgtga aggcagcgga 1680 
gacgggagct attacccaga ggccagcatc 1740 
ggtggacagt ccttccacta caggattggt 1800 
gctagagacc aatctttcca acacctgagt 1860 
tcatggaaat cacacggcga cctgtcgtct 1920 
tacattccag aaaatgtatc aagctctact 1980 
ccttcaaaaa tatgtttggt gtgtggggat 2040 
acctgtggca gctgcaaagt tttcttcaaa 2100 
tgtgctggaa gaaatgattg catcattgat 2160 
agacttcaga aatgtcttca agctggaatg 2220 
ggaaagttaa aagggattca cgaggagcag 2280 
ccacccccgc ' aaagcccaga ggaagggaca 23"40 
gtcaacacag cactggttcc tcagctctcc 2400 
gttatggtcc ttgaaaacat tgaacctgaa 2460 
ccagatacag ccgaaaatct gctctccacg 2520 
caagtcgtga agtgggcaaa ggtacttcca 2580 
attaccctaa tccagtattc ttggatgtgt 2640 
tacaaacata cgaacagcca atttctctat 2700 
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tttgcaccag acctagtctt taatgaataa gtacctatta attagccttc ataaaataaa 2760' 
ctgcagagtt atgcgttaaa 2780 



<210> 18 
<211> 481 
<212> PRT 
<213> HUMAN 

<400> 18 

Met Arg Glu Asn Met Ala Arg Gly Pro Cys Asn Ala Pro Arg Trp Val 
15 10 15 

Ser Leu Met Val Leu Val Ala lie Gly Thr Ala Val Thr Ala Ala Val 
20 25 30 

Asn Pro Gly Val Val Val Arg lie Ser Gin Lys Gly Leu Asp Tyr Ala 
35 40 * 45 

Ser Gin Gin Gly Thr Ala Ala Leu Gin Lys Glu Leu Lys Arg lie Lys ■ 
50 55 60 

lie Pro Asp Tyr Ser Asp Ser Phe Lys lie Lys His Leu Gly Lys Gly 
65 70 75 80 

His Tyr Ser Phe Tyr Ser Met Asp lie Arg Glu Phe Gin Leu Pro Ser 
85 90 95 

Ser Gin lie Ser Met Val Pro Asn Val Gly Leu Lys Phe Ser lie Ser 
100 105 110 

Asn Ala Asn lie Lys lie Ser Gly Lys Trp Lys Ala Gin Lys Arg Phe 
115 120 125 

Leu Lys Met Ser Gly Asn Phe Asp Leu Ser lie Glu Gly Met Ser lie 
130 135 140 

Ser Ala Asp Leu Lys Leu Gly Ser Asn Pro Thr Ser Gly Lys Pro Thr 
145 - 150 155 160 

lie Thr Cys Ser Ser Cys Ser Ser His lie Asn Ser Val His Val His 
165 170 175 

lie Ser Lys Ser Lys Val Gly Trp Leu lie Gin Leu Phe His Lys Lys 
180 185 190 

lie Glu Ser Ala Leu Arg Asn Lys Met Asn Ser Gin Val Cys Glu Lys 
195 200 205 

Val Thr Asn Ser Val Ser Ser Lys Leu Gin Pro Tyr Phe Gin Thr Leu 
210 215 220 

Pro Val Met Thr Lys lie Asp Ser Val Ala Gly lie Asn Tyr Gly Leu 
225 230 235 240 

Val Ala Pro Pro Ala Thr Thr Ala Glu Thr Leu Asp Val Gin Met Lys 
245 250 255 

Gly Glu Phe Tyr Ser Glu Asn His His Asn Pro Pro Pro Phe Ala Pro 
260 - 265 270 
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Pro Val Met Glu Phe Pro Ala Ala His Asp Arg Met Val Tyr Leu Gly 
275 230 285 



Leu Ser Asp Tyr Phe Phe Asn Thr Ala Gly Leu Val Tyr Gin Glu Ala 
290 295 * 300 

Gly Val Leu Lys Met Thr Leu Arg Asp Asp Met- lie Pro Lys Glu Ser 

305 310 • "315 320 

Lys Phe Arg Leu Thr Thr Lys Phe Phe Gly Thr Phe Leu Pro Glu Val 
325 330 33^ - 

Ala Lys Lys Phe Pro Asn Met Lys lie Gin lie His Val Ser Ala Ser 
340 345 350 

Thr Pro Pro His Leu Ser Val Gin Pro Thr Gly Leu Thr Phe Tyr Pro 
355 360 365 

Ala Val Asp Val Gin Ala Phe Ala Val Leu Pro Asn Ser Ser Leu Ala 
370 375 380 

Ser Leu Phe Leu lie Gly Met Gly Lys Gin Phe Leu Gly Trp 'Thr Asp 
385 390 395 400 

Glu Glu Pro Gin Thr Val Pro Thr Ala Leu Ser Leu Glu Ser Gly -Asp 
405 410 415 

His Val Asn Pro Val Trp He Gin Thr Trp Thr Val Ser Leu Arg Ser 
420 425 430 

Leu Arg Leu Glu Ser Leu Tyr Ser Met Val Pro Thr Pro Gly Gly He 
435 440 445 

His Ser- Pro Ser His Ser Leu Val Arg Leu Phe Thr Tyr Ser Phe Asn 
450 455 460 

Tyr Ser Phe Ser Gin Phe Leu He His Ser Xaa He His Ser Met Leu 
465 470 475 480 

Ala . - 



<210> 19 

<211> 628 

<212> PRT 

<213> HUMAN 

<400> 19 

Met Glv Arg -Leu Gin Leu Val Val Leu Gly Leu Thr Cys Cys Trp Ala 
1 ~ 5 10 15 

Val Ala Ser Ala Ala Lys Leu Gly Ala Val Tyr Thr Glu Gly Gly Phe 
20 25 30 

Val Glu Gly Val Asn Lys Lys Leu Gly Leu Leu Gly Asp Ser Val Asp. 
35 40 45 

He Phe Lys Gly He Pro Phe Ala Ala Pro Thr Lys Ala Leu Glu Asn 
50 55 60 
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Pro Gin Pro His Pro Gly Trp Gin Gly Thr Leu Lys Ala Lys Asn Phe 
65 70 75 . 80 

Lys Lys Arg Cys Leu Gin Ala Thr lie Thr Gin Asp Ser Thr Tyr Gly 
85 90 95 

Asp Glu Asp Cys Leu Tyr Leu Asn lie Trp Val Pro Gin Gly Arg Lys 
100 105 110 

Gin Val Ser Arg Asp Leu Pro Val Met lie Trp He Tyr Gly Gly Ala 
115 120 125 

Phe Leu Met Gly Ser Gly His Gly Ala Asn Phe Leu Asn Asn Tyr Leu 
130 135 140 

Tyr Asp Gly Glu Glu He Ala Thr Arg Gly Asn Val He Val Val Thr 
145 150 155 160 

Phe Asn Tyr Arg Val Gly Pro Leu Gly Phe Leu Ser Thr Gly Asp Ala 
165 170 175 

Asn Leu Pro Gly Asn Tyr Gly Leu Arg Asp Gin His Met Ala He Ala 
180 185 190 

Trp Val Lys Arg Asn He Ala Ala Phe Gly Gly Asp Pro Asn Asn He 
195 200 .. 205 

Thr Leu Phe Gly Glu Ser Ala Gly Gly Ala Ser Val Ser Leu Gin Thr 
210 215 220 

Leu Ser Pro Tyr Asn Lys Gly Leu He Arg Arg Ala lie Ser Gin Ser 
225 230 235 240 

Gly Val Ala Leu Ser Pro Trp Val He Gin Lys Asn Pro Leu Phe Trp 
245 250 255 

Ala Lys Lys Val Ala Glu Lys Val Gly Cys Pro Val Gly Asp Ala Ala 
260 265 270 

Arg Met Ala Gin Cys Leu Lys Val Thr Asp Pro Arg Ala Leu Thr Leu 
275 280 285 

Ala Tyr Lys Val Pro Leu Ala Gly Leu Glu Tyr Pro Met Leu His Tyr 
290 295 300 

Val Gly Phe Val Pro Val He Asp Gly Asp Phe He Pro Ala Asp Pro 
305 310 315 320 

He Asn Leu Tyr Ala Asn Ala Ala Asp He Asp Tyr He Ala Gly Thr 
325 330 335 

Asn Asn Met Asp Gly His He Phe Ala Ser He Asp Met Pro Ala He 
340 " 345 350 

Asn Lys Gly Asn Lys Lys Val Thr Glu Glu Asp Phe Tyr Lys Leu Val 
355 ^ 360 365 

Ser Glu Phe Thr He Thr Lys Gly Leu Arg Gly Ala Lys Thr Thr Phe 
370 . 375 • 380 
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Asp Val Tyr Thr Glu Ser Trp Ala Gin Asp Pro Ser Gin Glu Asn Lys 
385 390 395 400 



Lys Lys Thr Val Val Asp Phe Glu Thr Asp Val Leu Phe Leu Val Pro 
405 410 415 

Thr Glu ' lie Ala Leu Ala Gin His Arg Ala Asn Ala Lys Ser Ala Lys 
420 425 430 

Thr Tyr Ala Tyr Leu Phe Ser His Pro Ser Arg Met Pro Val Tyr Pro 
435 440 445 

Lys Trp Val Gly Ala Asp His Ala Asp Asp lie Gin Tyr Val Phe Gly 
450 455 460 

Lys Pro Phe Ala Thr Pro Thr Gly Tyr Arg Pro Gin Asp Arg Thr Val 
465 470 475 480 

Ser Lys Ala Met lie Ala Tyr Trp Thr Asn Phe Ala Lys Thr Gly Asp 
485 490 495 _ 

Pro Asn Met Gly Asp Ser Ala Val Pro Thr His Trp Glu Pro Tyr Thr 
500 505 " 510 

Thr. Glu Asn Ser Gly Tyr Leu Glu lie Thr Lys Lys Met Gly Ser Ser 
515 520 525 

Ser Met Lys Arg Ser Leu Arg Thr Asn Phe Leu Arg Tyr Trp Thr Leu 
530 535 540 

Thr Tyr Leu Ala Leu Pro Thr Val Thr Asp Gin Glu Ala Thr Pro Val 
545 550 555 560 

Pro Pro Thr Gly Asp Ser Glu Ala Thr Pro Val Pro Pro Thr Gly Asp 
565 570 575 

Ser Glu Thr Ala Pro Val Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro 
580 585 590 

Val Pro Pro Thr Gly Asp Ser Gly Ala Pro Pro Val Pro Pro Thr Gly 
595 600 605 

Asp Ser Gly Ala Pro Pro Val Pro Pro Thr Gly Asp Ser Glu Ala Arg 
610 615 620 

Ala His Leu Gly 
625 



<210> 20 
<211> 641 
<212> PRT 
<213> HUMAN 

<400> 20 

Leu Leu Leu Leu Gly Phe"' Leu Leu Val Ser Leu Glu Ser Thr Leu Ser - 
15 10 15 

lie Pro Pro Trp Glu Ala Pro Lys Glu His Lys Tyr Lys Ala Glu Glu 
20 25 30 
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His Thr Val Val Leu Thr Val Thr Gly Glu Pro Cys His Phe Pro Phe 
35 40 - * 45 



Gin Tyr His Arg Gin Leu Tyr His Lys Cys Thr His Lys Gly Arg Pro 
50 55 60 

Gly Pro Gin Pro Trp Cys Ala Thr Thr Pro Asn Phe Asp Gin Asp Gin 
65 70 75 80 

Arg Trp Gly Tyr Cys Leu Glu Pro Lys Lys Val Lys Asp His Cys Ser 
85 90 95 

Lys His Ser Pro Cys Gin Lys Gly Gly Thr Cys Val Asn Met Pro Ser 
100 ■ " 105 110 

Gly Pro His Cys Leu Cys Pro Gin His Leu Thr Gly Asn His Cys Gin 
115 120 125- 

Lys Glu Lys Cys Phe Glu Pro Gin Leu Leu Arg Phe Phe His Lys Asn 
130 -135 140 

Glu lie Trp Tyr Arg Thr Glu Gin Ala Ala Val Ala Arg Cys Gin Cys 
145 150 155 160 

Lys Gly Pro Asp Ala His Cys Gin Arg Leu Ala Ser Gin Ala Cys Arg 
165 170 175 

Thr Asn Pro Cys Leu His Gly Gly Arg Cys Leu Glu Val Glu Gly His 
180 - 185 190 

Arg Leu Cys His Cys Pro Val Gly Tyr Thr Gly Pro Phe Cys Asp Val 
195 200 205 

Asp Thr Lys Ala Ser Cys Tyr Asp Gly Arg Gly Leu Ser Tyr Arg Gly 
210 215 220 

Leu Ala Arg Thr Thr Leu Ser Gly Ala Pro Cys Gin Pro Trp Ala Ser 
225 230 235 240 

Glu Ala Thr Tyr Arg Asn Val Thr Ala Glu Gin Ala Arg Asn Trp Gly 
245 250 255 

Leu Gly Gly His Ala Phe Cys Arg Asn Pro Asp Asn Asp lie Arg Pro 
260 265 270 

Trp Cys Phe Val Leu Asn Arg Asp Arg Leu Ser Trp Glu Tyr Cys Asp 
275 - 280 285 

Leu Ala Gin Cys Gin Thr Pro Thr Gin Ala Ala Pro Pro Thr Pro Val 
290 295 300 

Ser Pro Arg Leu His Val Pro Leu Met Pro Ala Gin Pro Ala Pro Pro 
305 310 " 315 320 

Lvs Pro Gin Pro Thr Thr Arg Thr Pro Pro Gin Ser Gin Thr Pro Gly 
325 330 335 

Ala Leu Pro Ala Lys Arg Glu Gin Pro Pro Ser Leu Thr Arg Asn Gly 
340 ' 345 350 
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Pro Leu Ser Cys Gly Gin Arg Leu Arg Lys Ser Leu Ser Ser Met Thr 
355 ~ 360 365 



Arg Val Val Gly Gly Leu Val Ala Leu Arg Gl'y' Ala His Pro Tyr lie 
370 375 380 

Ala Ala Leu fyr Trp Gly His Ser Phe Cys Ala Gly Ser Leu lie Ala 
385 390 395 400 

Pro Cys Trp Val Leu Thr Ala Ala His Cys Leu Gin Asp Arg Pro Ala 
405 410 415 

Pro Glu Asp Leu Thr Val Val Leu Gly Gin Glu Arg Arg Asn His Ser 
420 425 430 

Cys Glu Pro Cys Gin Thr Leu Ala Val Arg Ser Tyr Arg Leu His Glu 
435 440 445 

Ala Phe Ser Pro Val Ser Tyr Gin His Asp Leu Ala Leu Leu Arg Leu 
450 455 460 

Gin Glu Asp Ala Asp Gly Ser Cys Ala Leu Leu Ser Pro Tyr Val Gin 
465 470 475 480 

Pro Val Cys Leu Pro Ser Gly Ala Ala Arg Pro Ser Glu Thr Thr Leu 
485 490 495 

Cys Gin Val Ala Gly Trp Gly His Gin Phe Glu Gly Ala Glu Glu Tyr 
500 505 510 

Ala Ser Phe Leu Gin Glu Ala Gin Val Pro Phe Leu Ser Leu Glu Arg 
515 520 525 

Cys Ser Ala Pro Asp Val His Gly Ser Ser lie Leu Pro Gly Met Leu 
530 535 540 

Cys Ala Gly Phe Leu Glu Gly Gly Thr Asp Ala Cys Ala Gly Glu Leu 
545 550 555 560 

Leu Ala Gly Trp Arg Pro Ser Pro Arg Pro Ser Ala Xaa Ser Gin Val 
565 570 575 

His Ser Ala Asp Cys Val Phe Pro Thr Gin Gly Asp Ser Gly Gly Pro 
580 585 590 

Leu Val Cys Glu Asp Gin Ala Ala Glu Ar.g Arg Leu Thr Leu Gin Gly 
595 600 605 

He He Ser Trp Gly Ser Gly Cys Gly Asp Arg Asn Lys Pro Gly Val 
610 615 620 

Tyr Thr Asp Val Ala Tyr Tyr Leu Ala Trp He Arg Glu His Thr Val 
625 * 630 635 640 

Ser 



<210> 21 
<211> 164 
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<212> PRT 
<213> HUMAN 



<400> 21 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly lie Leu Leu Leu Leu Trp 
1 5 10 15 

Leu lie Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 
50 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 80 

lie Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
85 90 95 

His Arg Ser His Asn Arg Ser Glu Glu Phe Leu lie Ala Gly Lys Leu 
100 105 110 

Gin Asp Gly Leu Leu His lie Thr Thr Cys Ser Phe Val Ala Pro Trp 
115 120 125 

Asn Ser Leu Ser Leu Ala Gin Arg Arg Gly Phe Thr Lys Thr Tyr Thr 
130 135 140 

Val Gly Cys Glu Glu Cys Thr Val Phe Pro Cys Ser His Ser His Leu 
145 150 155 160 

Ser Ser Gly Gin 



<210> 22 

<211> 123 

<212> PRT 

<213> HUMAN 

<400> 22 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly lie Leu Leu Leu Leu Trp 
15 10 15 

Leu lie Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 
50 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 80 

lie Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 



19 



85 



90 



95 



His Arg Ser His Asn Arg Ser Glu Glu Phe Leu lie Ala Gly Lys Leu 

100 105 110 

Gin Val Val Met Cys Lys Ser Pro Ser Val Val 

115 120 



<210> 23 
<211> 211 
<212> PRT 
<213> HUMAN 

<400> 23 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly He Leu Leu Leu Leu Trp 
15 10 15 

Leu He Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 . . 30 

Thr Ala P^he Cys Asn Ser Asp Leu Val He Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 

50 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 - 80 

He Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
85 90 95 

His Arg Ser His Asn Arg Ser Glu Glu Phe Leu He Leu Leu Gly Lys 
100 105 110 

Leu Gin Asp Gly He Phe Ala His Ser Leu Thr Cys Ser Phe Cys Trp 
115 120 125 

Val Pro Trp Glu Asn Ser Leu Ser Leu Ala Gin Arg Arg Gly Phe Thr 
130 135 140 

Lys Thr Tyr Thr Val Gly Cys Glu Glu Cys Thr Val Phe Pro Cys Leu 
145 150 155 160 

Ser He Pro Cys Lys Leu Gin Ser Gly Thr His Cys Leu Trp Thr Asp 
165 170 175 

Gin Leu Leu Gin Gly Ser Glu Lys Gly Phe Gin Ser Arg His Leu Ala 
180 185 190 

Cys Leu Pro Arg Glu Pro Gly Leu Cys Thr Trp Gin Ser Leu Arg Ser 
195 200 205 

Gin He Ala 
210 



<210> 24 



20 



<211> 160 
<212> PRT 
<213> HUMAN 



<400> 24 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly lie Leu Leu Leu Leu Trp 
15 10 15 

Leu lie Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 
50 * 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 70 75 80 

lie Arg Phe Val Tyr Thr Pro Ala Met Glu Ser Val Cys Gly Tyr Phe 
85 90 95 

His Arg Ser His Asn Arg Ser Glu Glu Phe Leu lie Leu Ser lie Pro 
100 105 - 110 

Cys Lys Leu Gin Ser Gly Thr His Cys Leu Trp Thr Asp Gin Leu Leu 
115 120 125 

Gin Gly Ser Glu Lys Gly Phe Gin Ser Arg His Leu Ala Cys Leu Pro 
130 135 140 

Arg Glu Pro Gly Leu Cys Thr Trp Gin Ser Leu Arg Ser Gin lie Ala 
145 150 155 160 



<210> 25 

<211> 197 

<212> PRT 

<213> HUMAN _ 

<400> 25 

Met Ala Pro Phe Glu Pro Leu Ala Ser Gly lie Leu Leu Leu Leu Trp 
15 10 15 

Leu lie Ala Pro Ser Arg Ala Cys Thr Cys Val Pro Pro His Pro Gin 
20 ' 25 30 

Thr Ala Phe Cys Asn Ser Asp Leu Val lie Arg Ala Lys Phe Val Gly 
35 40 45 

Thr Pro Glu Val Asn Gin Thr Thr Leu Tyr Gin Arg Tyr Glu lie Lys 
50 ~" 55 60 

Met Thr Lys Met Tyr Lys Gly Phe Gin Ala Leu Gly Asp Ala Ala Asp 
65 " 70 * 75 80 
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lie Arg Phe Val Tyr Thr Pro Ala 
85 

His Arg Ala Gly Lys Leu Gin Asp 
100 

Ser Phe Val Ala Pro Trp' Asn Ser 

115 120 

Phe Thr Lys Thr Tyr Thr Val Gly 
130 135 

Cys Leu Ser lie Pro Cys Lys Leu 
145 150 

Thr Asp Gin Leu Leu Gin Gly Ser 
165 

Leu Ala Cys Leu Pro Arg Glu Pro 
180 

Arg Ser Gin lie Ala 
195 



Met Glu Ser Val Cys Gly Tyr Phe 
90 95 

Gly' Leu Leu His lie Thr Thr Cys 
105 110 

Leu Ser Leu Ala Gin Arg Arg Gly 
125 

Cys Glu Glu Cys Thr Val Phe Pro 
140 

Gin Ser Gly Thr His Cys Leu Trp 

155 160 

Glu Lys Gly Phe Gin Ser Arg His 
170 175 

Gly Leu Cys Thr Trp Gin Ser Leu 
185 190 



<210> 26 
<211> 494 
<212> PRT 
<213> HUMAN 

<400> 26 

Met Arg Ala Leu Leu Ala Arg Leu Leu Leu Cys Val Leu Val Val Ser 
15 10 15 

Asp Ser Lys Gly Ser Asn Glu Leu His Gin Val Pro Ser Asn Cys Asp 
20 25 30 

Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser Asn lie 
35 40 45 

His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys Glu lie 
50 55 60 

Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr Arg Gly 
65 70 75 80 

Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp Asn Ser 
85 90 95 

Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp Ala Leu 
100 105 110 

Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Glu Val Gly Ala Gin 
115 120 125 

Gly Pro Lys Ala Leu Pro"' Thr Val Pro Arg Asn Leu Val Thr lie Pro 
130 135 140 

Phe Ser Gin Arg Ala Gly His Ser Thr Arg Glu Val Gin Pro Leu Val 
145 * 150 155 160 
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Glu Ser Ser Leu Arg Gly Gly Gly Arg Glu Gly Pro Leu Gly Trp Asn 
165 170 * 175 



Asp lie Pro Tyr Leu Ser Val Leu Pro Gly Asn Pro Asp Asn Arg Arg 
180 185 190 

Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu Val Gin Glu 
195 200 205 

Cys Met Val His Asp Cys Ala Asp Gly Lys Lys Pro Ser Ser Pro Pro 
210 215 220 

Glu Glu Leu Lys Phe Gin Cys Gly Gin Lys Thr Leu Arg Pro Arg Phe 
225 230 235 240 

Lys lie lie Gly Gly Glu Phe Thr Thr lie Glu Asn Gin Pro Trp Phe 
245 250 255 

Ala Ala lie Tyr Arg Arg His Arg Gly Gly Ser Val Thr Tyr Val Cys 
260 " 265 270 

Gly Gly Ser Leu lie Ser Pro Cys Trp Val lie Ser Ala Thr His Cys 
275 280 285 

Phe lie Asp Tyr Pro Lys Lys Glu Asp Tyr lie Val Tyr Leu Gly Arg 
290 ■ 295 300 

Ser Arg Leu Asn Ser Asn Thr Gin Gly Glu Met Lys ■ Phe Glu Val Glu 
305 310 315 320 

Asn Leu lie Leu His Lys Asp Tyr Ser Ala Asp Thr Leu Ala His His 
325 330 335 

Asn Asp lie Ala Leu Leu Lys lie Arg Ser Lys Glu Gly Arg Cys Ala 
340 345 350 

Gin Pro Ser Arg Thr lie Gin Thr lie Cys Leu Pro Ser Met Tyr Asn 
355 360 365 

Asp Pro Gin Phe Gly Thr Ser Cys Glu lie Thr Gly Phe Gly Lys Glu 
370 375 380 

Asn Ser Thr Asp Tyr Leu Tyr Pro Glu Gin Leu Lys Met Thr Val Val 
385 390 395 400 

Lys Leu lie Ser His Arg Glu Cys Gin Gin Pro His Tyr Tyr Gly Ser 
405 410 415 

Glu Val Thr Thr Lys Met Leu Cys Ala Ala Asp Pro Gin Trp Lys Thr 
420 425 430 

Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Ser Leu Gin 
435 440 445 

Gly Arg Met Thr Leu Thr Gly lie Val Ser Trp Gly Arg Gly Cys Ala 
4 50 ^ 4 55 4 60 

Leu Lys Asp Lys Pro Gly Val Tyr Thr Arg Val Ser His Phe Leu Pro 
465 470 ' 475 . 480 
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Trp He Arg Ser His Thr Lys Glu Glu Asn Gly Leu Ala Leu 
485 490 



<210> 27 
<211> 285 
<212> PRT 
<213> HUMAN 

<400> 27 

Met Gin Met Ser Pro Ala Leu Thr Cys Leu Val Leu Gly Leu Ala Leu 
15 10 15 

Val Phe Gly Glu Gly Ser Ala Val His His Pro Pro Ser Tyr Val Ala 
20 25 30 

His Leu Ala Ser Asp Phe Gly Val Arg Val Phe Gin Gin Val Ala Gin 
35 40 45 

Ala Ser Lys Asp Arg Asn Val Val Phe Ser Pro Tyr Gly Val Ala Ser 
50 55 60 

Val Leu Ala Met Leu Gin Leu Thr Thr Gly Gly Glu Thr Gin Gin Gin 
65 70 . 75 " 80 

He Gin Ala Ala Met Gly Phe Lys lie Asp Asp Lys Gly Met Ala Pro 
85 90 95 

Ala Leu Arg His Leu Tyr Lys Glu Leu Met Gly Pro Trp Asn Lys Asp 
100 105 110 

Glu He Ser Thr Thr Asp Ala He Phe Val Gin Arg Asp Leu Lys Leu 
115 120 125 

Val Gin Gly Phe Met Pro His Phe Phe Arg Leu Phe Arg Ser Thr Val 
130 135 14.0 

Lys Gin Val Asp Phe Ser Glu Val Glu Arg Ala Arg Phe He He Asn 
145 150 155 160 

Asp Trp Val Lys Thr His Thr Lys Gly Met He Ser Asn Leu Leu Gly 
165 170 175 

Lys Gly Ala Val Asp Gin Leu Thr Arg Leu Val Leu Val Asn Ala Leu 
180 185 190 

Tyr Phe Asn Gly Gin Trp Lys Thr Pro Phe Pro Asp Ser Ser Thr His 
195 200 205 

Arg Arg Leu Phe His Lys Ser Asp Gly Ser Thr Val Ser Val Pro Met 
210 215 220 

Met Ala Gin Thr Asn Lys Phe Asn Tyr Thr Glu Phe Thr Thr Pro Asp 
225 230 235 - 240 

Gly His Tyr Tyr Asp He"" Leu Glu Leu Pro Tyr His Gly Asp Thr Leu 
245 250 255 

Ser Met Phe He Ala Ala Asp Leu Val Pro Thr Glu Ala Leu Cys Arg 
260 265 270 
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Met Glu Leu Arg Gly Leu Gin Glu Leu Leu Cys Ala Trp 
275 280 - 235 



<210> 28 
<211> 399 
<212> PRT 
<213> HUMAN 

<400> 28 

Met Gin Met Ser Pro Ala Leu Thr Cys Leu Val Leu Gly Leu Ala Leu 
15 10 15 

Val Phe Gly Glu Gly Ser Ala Val His His Pro Pro Ser Tyr Val Ala 
20 25 30 

His Leu Ala Ser Asp Phe Gly Val Arg Val Phe Gin Gin Val Ala Gin 
35 40 45 

Ala Ser Lys Asp Arg Asn Val Val Phe Ser Pro Tyr Gly Val Ala Ser 
50 " - 55 60 

Val Leu Ala Met Leu Gin Leu Thr Thr Gly Gly Glu Thr Gin Gin Gin 
65 70 75 80 

lie Gin Ala Ala Met Gly Phe Lys Tie Asp Asp Lys Gly Met Ala Pro 
85 90 95 

Ala Leu Arg His Leu Tyr Lys Glu Leu Met Gly Pro Trp Asn Lys Asp 
100 105 110 

Glu lie Ser Thr Thr Asp Ala lie Phe Val Gin Arg Asp Leu Lys Leu 
115 120 125 

Val Gin Gly Phe Met Pro. His Phe Phe Arg Leu Phe Arg Ser Thr Val 
130 135 140 

Lys Gin Val Asp Phe Ser Glu Val Glu Arg Ala Arg Phe lie lie Asn 
145 150 155 160 

Asp Trp Val Lys Thr His Thr Lys Gly Met lie Ser Asn Leu Leu Gly 
165 170 175 

Lys Gly Ala Val Asp Gin Leu Thr Arg Leu Val Leu Val Asn Ala Leu 
180 185 190 

Tyr Phe Asn Gly Gin Trp Lys Thr Pro Phe Pro Asp Ser Ser Thr His 
195 200 205 

Arg Arg Leu Phe His Lys Ser Asp Gly Ser Thr Val Ser Val Pro Met 
210 215 220 

Met Ala Gin Thr Asn Lys Phe Asn Tyr Thr Glu Phe Thr Thr Pro Asp 
225 230 235 240 

Gly His Tyr Tyr Asp lie Leu Glu Leu Pro Tyr His Gly Asp Thr Leu 
245 250 255 



Ser Met Phe lie Ala Ala Pro Tyr Glu Lys Glu Val Pro Leu Ser Ala 



260 



265 



270 



Leu Thr Asn lie Leu Ser Ala Gin Leu lie Ser His Trp Lys Gly Asn 
275 280 235 

Met Thr Arg Leu Pro Arg Leu Leu Val Leu Pro Lys Phe Ser Leu Glu 
290 295 300 

Thr Glu Val Asp Leu Arg Lys Pro Leu Glu Asn Leu Gly Met Thr Asp 
305 310 315 320 

Met Phe Arg Gin Phe Gin Ala Asp Phe Thr Ser Leu Ser Asp Gin Glu 
325 330 335 

Pro Leu His Val Ala Gin Ala Leu Gin Lys Val Lys lie Glu Val Asn 
340 345 350 

Glu Ser Gly Thr Val Ala Ser Ser Ser Thr Ala Val lie Val Ser Ala 
355 360 365 

Arg Met Ala Pro Glu Glu lie lie Met Asp Arg Pro Phe Leu Phe Val 
370 375 380 

Val Pro Pro Gin Lys Gin Cys Ala Trp Val lie Leu Glu Cys Arg 
385 390 395 



<210> 29 

<211> 317 

<212> PRT 

<213> HUMAN 

<400> 29 

Met Thr Ala Ala Ser Met Gly Pro Val Arg Val Ala Phe Val Val Leu 
15 10 15 

Leu Ala Leu Cys Ser Arg Pro Ala Val Gly Gin Asn Cys Ser Gly Pro 
20 25 30 

Cys Arg Cys Pro Asp Glu Pro Ala Pro Arg Cys Pro Ala Gly Val Ser 
35 40 45 

Leu Val Leu Asp Gly Cys Gly Cys Cys Arg Val Cys Ala Lys Gin Leu 
50 " 55 60 

Gly Glu Leu Cys Thr Glu Arg Asp Pro Cys Asp Pro His Lys Gly Leu 
65 70 75 80 

Phe Cys Asp Phe Gly Ser Pro Ala Asn Arg Lys lie Gly Val Cys Thr 
85 90 95 

Ala Lys Asp Gly Ala Pro Cys lie Phe Gly Gly Thr Val Tyr Arg Ser 
100 105 110 

Gly Giu Ser Phe Gin Ser Ser Cys Lys Tyr Gin Cys Thr Cys Leu Asp 
115 120 125 

Gly Ala Val Gly Cys Met Pro Leu Cys Ser Met, Asp Val A.rg Leu Pro 
130 135 " 140 
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Ser Pro Asp Cys Pro Leu Pro Leu Glu Asp Thr Phe Gly Pro Asp Pro 
145 150 155 160 



Thr Met lie Arg Ala Asn Cys Leu Val Gin Thr Thr Glu Trp Ser Ala 
165 170 175 

Cys Ser Lys Thr Cys Gly Met Gly lie Ser Thr Arg Val Thr Asn Asp 
180 185 190 

Asn Ala Ser Cys Arg Leu Glu Lys Gin Ser Arg Leu Cys Met Val Arg 
195 200 205 

Pro Cys Glu Ser Asp Leu Glu Glu Asn lie Lys Lys Gly Lys Lys Cys 
210 215 220 

lie Arg Thr Pro Lys lie Ser Lys Pro lie Lys Phe Glu Leu Ser Gly 
225 230 235 240 

Cys Thr Ser Met Lys Thr Tyr Arg Ala Lys Phe Cys Gly Val Cys Thr 
245 250 255 

Asp Gly Arg Cys Cys Thr Pro His Arg Thr Thr Thr Leu Pro Val Glu 
260 265 270 

Phe Lys Cys Pro Asp Gly Glu Val Met Lys Lys Asn Met Met Phe lie 
275 280 285 

Lys Thr Cys Ala Cys His Tyr Asn Cys Pro Gly Asp Asn Asp lie Phe 
290 295 300 

Glu Ser Leu Tyr Tyr Arg Lys Met Tyr Gly Asp Met Ala 
305 310 315 



<210> 30 

<211> 342 

<212> PRT 

<213> HUMAN 

<400> 30 
Asn Met Glu Asn 
1 

Val Leu Phe Gly 
20 

Phe Gin lie Lys 
35 

Asp Ala Val Thr 
50 

Glu Ser Asp Arg 
65 

His Leu Ala Leu 



Ser Leu Leu He 
100 



Ser Leu Arg Cys 
5 

Ala Ser Leu Leu 



Ala Phe Thr Ala 
40 

Met Arg Gly Gly 
55 

Gly Val Pro Val 
70 

Gly MetT-Asp Glu 
85 

Gin Asn He Leu 



Val Trp Val Pro 
10- 

Ser Ala His Leu 
25 

Leu Arg Phe Leu 



Asn Val Leu Leu 
60 



lie Lys Trp Lys 
75 

Arg Lys Gin Gin 
90 

His Ser Arg His 
105 



Lys Leu Ala Phe 
15 

Gin Val Thr Gly 
30 

Ser Glu Pro Ser 
45 

Asp Cys Ser Ala 



Lys Asp Ala He 
80 

Leu Ser Asn Gly 
95 

His Lys Pro Asp 
110 
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Glu Gly Leu Tyr Gin Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser lie 
115 120 125 

lie Ser Arg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu 
130 135 140 

Ser Gin Thr Glu Ser. Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu 
145 150 " 155 160 

Lys Cys Glu Val He Gly Glu Pro Met Pro Thr He His Trp Gin Lys 
165 170 175 

Asn Gin Gin Asp Leu Thr Pro lie Pro Gly Asp Ser Arg Val Val Val 
180 185 190 

Leu Pro Ser Gly Ala Leu Gin He Ser Arg Leu Gin Pro Gly Asp He 
195 200 * 205 

Gly He Tyr Arg Cys Ser Ala Arg Asn Pro Ala Ser Ser Arg Thr Gly 
210 215 220 

Asn Glu Ala. Glu Val Arg He Leu Ser Asp Pro Gly Leu His Arg Gin 
225 230 235 240 

Leu Tyr Phe Leu Gin Arg Pro Ser Asn Val Val Ala He Glu Gly Lys 
245 250 255 

Asp Ala Val Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe 
260 265 270 

Thr Trp Leu Arg Gly Glu Glu Val He Gin Leu Arg Ser Lys Lys Tyr 
275 280 285 

Ser Leu Leu Gly Gly Ser Asn Leu Leu He Ser Asn Val Thr Asp Asp 
290 295 300 

Asp Ser Gly Met Tyr Thr Cys Val Val Thr Tyr Lys Asn Glu -Asn He 
305 . - 310 315 320 

Ser Ala Ser Ala Glu Leu Thr Val Leu Val He He Asp Lys Val Leu 
325 330 335 

Val Asp Thr Phe Trp Val 
340 



<210> 31 

<211> 1433 

<212> PRT 

<213> HUMAN 

<400> 31 

Asn Met Glu Asn Ser Leu Arg Cys Val Trp Val Pro Lys Leu Ala Phe 
1 . - 5 - 10 15 

Val Leu Phe Gly Ala Ser Leu Leu Ser Ala His Leu Gin Val Thr Gly 
20 25 30 

Phe Gin He Lys Ala Phe Thr Ala Leu Arg Phe Leu Ser Glu Pro Ser 
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35 



40 



45 



Asp Ala Val Thr Met Arg Gly Gly Asn Val Leu Leu Asp Cys Ser Ala 
50 55 60 

Glu Ser Asp Arg Gly Val Pro Val lie Lys Trp Lys Lys Asp Ala lie 
65 70 75 80 

His Leu Ala Leu Gly Met Asp Glu Arg Lys Gin Gin Leu Ser Asn Gly 
85 90 95 

Ser Leu Leu lie Gin Asn lie Leu His Ser Arg His His Lys Pro Asp 
100 105 110 

Glu Gly Leu Tyr Gin Cys Glu Ala Ser Leu Gly Asp Ser Gly Ser lie 
115 * 120 125 

lie Ser Arg Thr Ala Lys Val Ala Val Ala Gly Pro Leu Arg Phe Leu 
130 135 140 

Ser Gin Thr Glu Ser Val Thr Ala Phe Met Gly Asp Thr Val Leu Leu 
145 150 155 160 

Lys Cys Glu Val He Gly Glu Pro Met Pro Thr He His Trp Gin Lys 
165 170 175 

Asn Gin Gin Asp Leu Thr Pro He Pro Gly Asp Ser Arg Val Val Val 
180 185 190 

Leu Pro Ser Gly Ala Leu Gin lie Ser Arg Leu Gin Pro Gly Asp He 
195 200 205 

Gly He Tyr Arg Cys Ser Ala Arg Asn Pro Ala Ser Ser Arg Thr Gly 
210 215 220 

Asn Glu Ala Glu Val Arg He Leu Ser Asp Pro Gly Leu His Arg Gin 
225 230 235 240 

Leu Tyr Phe Leu Gin Arg Pro Ser Asn Val Val Ala He Glu Gly Lys 
245 250 255 

Asp Ala Val Leu Glu Cys Cys Val Ser Gly Tyr Pro Pro Pro Ser Phe 
260 265 270 

Thr Trp Leu Arg Gly Glu Glu Val He Gin Leu Arg Ser Lys Lys Tyr 
275 280 285 

Ser Leu Leu Gly Gly Ser Asn Leu Leu He Ser Asn Val Thr Asp Asp 
290 295 300 

Asp Ser Gly Met Tyr Thr Cys Val Val Thr Tyr Lys Asn Glu Asn He 
305 310 315 320 

Ser Ala Ser Ala Glu Leu Thr Val Leu Val Pro Pro Trp Phe Leu Asn 
325 330 335 

His Pro Ser Asn Leu Tyr" Ala Tyr Glu Ser Met Asp He Glu Phe Glu 
340 345 350 

Cys Thr Val Ser Gly Lys Pro Val Pro Thr Val Asn Trp Met Lys Asn 
355 360 365 
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Gly Asp Val Val lie Pro Ser Asp Tyr Phe Gin lie Val Gly Gly Ser 
370 375 380 



Asn Leu Arg lie Leu Gly Val Val Lys Ser Asp Glu Gly Phe Tyr Gin 
385 390 395 400 

Cys Val Ala Glu Asn Glu Ala Gly Asn Ala Gin Thr Ser Ala Gin Leu 
405 410 415 

lie Val Pro Lys Pro Ala He Pro Ser Ser Ser Val Leu Pro Ser Ala 
420 425 430 

Pro Arg Asp Val Val Pro Val Leu Val Ser Ser Arg Phe Val Arg Leu 
435 440 445 

Ser Trp Arg Pro Pro Ala Glu Ala Lys Gly Asn He Gin Thr Phe Thr 
450 455 460 

Val Phe Phe Ser Arg Glu Gly Asp Asn Arg Glu Arg Ala Leu Asn Thr 
465 470 475 480 

Thr Gin Pro Gly Ser Leu Gin Leu Thr Val Gly Asn- Leu Lys Pro Glu 
485 490 495 

Ala Met Tyr Thr Phe Arg Val Val Ala Tyr Asn Glu Trp Gly Pro Gly 
500 505 510 

Glu Ser Ser Gin Pro He Lys Val Ala Thr Gin Pro Glu Leu Gin Val 
515 520 525 

Pro Gly Pro Val Glu Asn Leu Gin Ala Val Ser Thr Ser Pro Thr Ser 
530 535 540 

He Leu He Thr Trp Glu Pro Pro Ala Tyr Ala Asn Gly Pro Val Gin 
545 550 555 560 

Gly Tyr Arg Leu Phe Cys Thr Glu Val Ser Thr Gly Lys Glu Gin 'Asn 
565 570 575 

He Glu Val Asp Gly Leu Ser Tyr Lys Leu Glu Gly Leu Lys Lys Phe 
580 585 590 

Thr Glu Tyr Ser Leu Arg Phe Leu Ala Tyr Asn Arg Tyr Gly Pro Gly 
595 600 605 

Val Ser Thr Asp Asp He Thr Val Val Thr Leu Ser Asp Val Pro Ser 
610 615 620 

Ala Pro Pro Gin Asn Val Ser Leu Glu Val Val Asn Ser Arg Ser He 
625 630 635 640 

Lys Val Ser Trp Leu Pro Pro Pro Ser Gly Thr Gin Asn Gly Phe He 
645 650 655 

Thr Gly Tyr Lys He Arg His Arg Lys Thr Thr Arg Arg Gly Glu Met 
660 *** 665 670 

Glu Thr Leu Glu Pro Asn Asn Leu Trp Tyr Leu Phe Thr Gly Leu Glu 
675 680 685 
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Lys Gly Ser Gin Tyr Ser Phe Gin Val Ser Ala Met Thr Val Asn Gly 
690 695 700 



Thr Gly Pro Pro Ser Asn Trp Tyr Thr Ala Glu Thr Pro Glu Asn Asp 
705 710 715 720 

Leu Asp Glu Ser Gin Val Pro Asp Gin Pro Ser Ser Leu His Val Arg 
725 730 735 

Pro Gin Thr Asn Cys lie lie Met Ser Trp Thr Pro Pro Leu Asn Pro 
740 745 750 

Asn lie Val Val Arg Gly Tyr lie lie Gly Tyr Gly Val Gly Ser Pro 
755 760 765 

Tyr Ala Glu Thr Val Arg Val Asp Ser Lys Gin Arg Tyr Tyr Ser lie 
770 775 780 

Glu Arg Leu Glu Ser Ser Ser His Tyr Val lie Ser Leu Lys Ala Phe 
785 790 795 800 

Asn Asn Ala Gly Glu Gly Val Pro Leu Tyr Glu Ser Ala Thr Thr Arg 
805 810 815 

Ser lie Thr Asp Pro Thr Asp Pro Val Asp Tyr Tyr Pro Leu Leu Asp 
820 825 830 

Asp Phe Pro Thr Ser Val Pro Asp Leu Ser Thr Pro Met Leu Pro Pro 
835 840 845 

Val Gly Val Gin Ala Val Ala Leu Thr His Asp Ala Val Arg Val Ser 
850 855 860 

Trp Ala Asp Asn Ser Val Pro Lys Asn Gin Lys Thr Ser Glu Val Arg 
865 870 875 880 

Leu Tyr Thr Val Arg Trp Arg Thr Ser Phe Ser Ala Ser Ala Lys Tyr 
885 890 895 

Lys Ser Glu Asp Thr Thr Ser Leu Ser Tyr Thr Ala Thr Gly Leu Lys 
900 905 910 

Pro Asn Thr Met Tyr Glu Phe Ser Val Met Val Thr Lys Asn Arg Arg 
915 920 925 

Ser Ser Thr Trp Ser Met Thr Ala His Ala Thr Thr Tyr Glu Ala Ala 
930 935 940 

Pro Thr Ser Ala Pro Lys Asp Phe' Thr Val lie Thr Arg Glu Gly Lys 
945 950 955 960 

Pro Arg Ala Val lie Val Ser Trp Gin Pro Pro Leu Glu Ala Asn Gly 
965 970 975 

Lvs lie Thr Ala Tyr lie Leu Phe Tyr Thr Leu Asp Lys Asn lie Pro 
980 985 990 

lie Asp Asp Trp lie Met Glu Thr lie Ser Gly Asp Arg Leu Thr His_ 
995 1000 ** 1005 

Gin lie Met Asp Leu Asn Leu Asp Thr Met Tyr Tyr Phe Arg lie Gin 



31 



1010 1015 1020 

Ala Arg Asn Ser Lys Gl'y Val Gly Pro Leu Ser -Asp Pro lie Leu Phe 
1025 * 1030 " 1035 1040 

Arg Thr Leu Lys Val Glu His Pro Asp Lys Met Ala Asn Asp Gin Gly 
1045 1050 1055 

Arg His Gly Asp Gly Gly Tyr Trp Pro Val Asp Thr Asn Leu lie Asp 
1060 1065 1070 

Arg Ser Thr Leu Asn Glu Pro Pro lie Gly Gin Met His Pro Pro His 
1075 1080 1085 

Gly Ser Val Thr Pro Gin Lys Asn Ser Asn Leu Leu Val lie lie Val 
1090 1095 1100 

Val Thr Val Gly Val lie Thr Val Leu Val Val Val He Val Ala Val 
1105 1110 1115 1120 

He Cys Thr Arg Arg Ser Ser Ala Gin Gin Arg Lys Lys Arg Ala Thr 
1125 1130 1135 

His Ser Ala Gly Lys Arg Lys Gly Ser Gin Lys Asp Leu Arg Pro Pro 
1140 1145 1150 

Asp Leu Trp He His His Glu Glu Met Glu Met Lys Asn He Glu Lys 
1155 1160 1165 

Pro Ser Gly Thr Asp Pro Ala Gly Arg Asp Ser Pro He Gin Ser Cys 
1170 1175 1180 

Gin Asp Leu Thr Pro Val Ser His Ser Gin Ser Glu Thr Gin Leu Gly 
1185 * 1190 1195 1200 

Ser Lys Ser Thr Ser His Ser Gly Gin Asp Thr Glu Glu Ala Gly Ser 
1205 1210 1215 

Ser Met Ser Thr Leu Glu Arg Ser Leu Ala Ala Arg Arg Ala Pro Arg 
1220 1225 1230 

Ala Lys Leu Met He Pro Met Asp Ala Gin Ser Asn Asn Pro Ala Val 
1235 1240 1245 

Val Ser Ala He Pro Val Pro Thr Leu Glu Ser Ala Gin Tyr Pro Gly 
1250 1255 1260 

He Leu Pro Ser Pro Thr Cys Gly Tyr Pro His Pro Gin Phe Thr Leu 
1265 1270 1275 1280 

Arg Pro Val Pro Phe Pro Thr Leu Ser Val Asp Arg Gly Phe Gly Ala 
1285 ■ 1290 v 1295 

Gly Arg Ser Gin Ser Val Ser Glu Gly Pro Thr Thr Gin Gin Pro Pro 
1300 1305 1310 

Met Leu Pro Pro Ser GlrT Pro Glu His Ser Ser Ser Glu Glu Ala Pro 
1315 1320 1325 

Ser Arg Thr He Pro Thr Ala Cys Val Arg Pro Thr His Pro Leu Arg 
1330 1335 - 1340 
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Ser Phe Ala Asn Pro Leu Leu Pro Pro Pro Met Ser Ala lie Glu Pro 
1345 1350 1355 1360 

Lys Val Pro Tyr Thr Pro Leu Leu Ser Gin Pro Gly Pro Thr Leu Pro 
1365 1370 1375 

Lys Thr His Val Lys Thr Ala Ser Leu Gly Leu Ala Gly Lys Ala Arg 
1380 1385 1390 

Ser Pro Leu Leu Pro Val Ser Val Pro Thr Ala Pro Glu Val Ser Glu 
1395 1400 1405 

Glu Ser His Lys Pro Thr Glu Asp Ser Ala Asn Val Ser Ala Ser Leu 
1410 1415 1420 

Lys Phe Met Leu His Gin Gly Thr Asp 
1425 1430 



<210> 32 

<211> 865 

<212> PRT 

<213> HUMAN 

<400> 32 

Met Pro Gly Lys Arg Gly Leu Gly Trp Trp Trp Ala Arg Leu Pro Leu 
15 10 15 

Cys Leu Leu Leu Ser Leu Tyr Gly Pro Trp Met Pro Ser Ser Leu Gly 
20 25 30 

Lys Pro Lys Gly His Pro His Met Asn Ser lie Arg lie Asp Gly Asp 
35 40 45 

lie Thr Leu Gly Gly Leu Phe Pro Val His Gly Arg Gly Ser Glu Gly 
50 55 60 

Lys Pro Cys Gly Glu Leu Lys Lys Glu Lys Gly lie His Arg Leu Glu 
65 70 75 80 

Ala Met Leu. Phe Ala Leu Asp Arg lie Asn Asn Asp Pro Asp Leu Leu 
85 90 95 

Pro Asn lie Thr Leu Gly Ala Arg lie Leu Asp Thr Cys Ser Arg Asp 
100 105 110 

Thr His Ala Leu Glu Gin Ser Leu Thr Phe Val Gin Ala Leu lie Glu 
115 120 125 

Lys Asp Gly Thr Glu Val Arg Cys Gly Ser Gly Gly Pro Pro lie lie 
130 135 140 

Thr Lys Pro Glu Arg Val Val Gly Val lie Gly Ala Ser Gly Ser Ser 
145 150 155 160 

Val Ser lie Met Val Ala Asn lie Leu Arg Leu Phe Lys lie Pro Gin 
165 170 175 

lie Ser Tyr Ala Ser Thr -Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr 
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180 



185 



190 



Asp Phe Phe Ser 
195 

Met Val Asp lie 
210 

Ala Ser Glu Gly 
225 

Lys Ser Arg Glu 



Pro Arg Glu Pro 
260 

Leu Glu Thr Ser 
275 

Asp lie Arg Arg 
290 

His Phe Phe Trp 
305 

Val Leu His Leu 



Lys Arg Met Ser 
340 

Tyr Glu Gin Glu 
355 

Met Gly His Ala 
370 

Val Gly Leu Cys 
385 

Lys Tyr lie Arg 



Thr Phe Asn Glu 
420 

Tyr Gin Leu Arg 
435 

Thr Asp His Leu 
450 

Gly Gin Gin Leu 
465 

Glu Arg Lys Lys 



Pro Cys Thr Gly 
500 



Arg Val Val Pro 
200 

Val Arg Ala Leu 
215 

Ser Tyr Gly Glu 
230 

Asp Gly Gly Val 
245 

Lys Ala Gly Glu 



Asn Ala Arg Ala 
280 

Val Leu Glu Ala 
295 

Met Gly Ser Asp 
310 

Glu Glu Val Ala 
325 

Val Arg Asp Arg 



Gly Lys Val Gin 
360 

Leu His Ala Met 
375 

Pro Arg Met Asp 
3 90 

Asn Val Asn Phe 
405 

Asn Gly Asp Ala 



Asn Asp Ser Ala 
440 

His Leu Arg lie 
455 

Pro Arg Ser lie 
470 

Thr Va'r Lys Gly 
485 

Tyr Gin Tyr Gin 



Ser Asp Thr Tyr 



Lys Trp Asn Tyr 
220 

Ser Gly Val Glu 
235 

Cys lie Ala Gin 
250 

Phe Asp Lys lie 
265 

Val lie lie Phe 



Ala Arg Arg Ala 
300 

Ser Trp Gly Ser 
315 

Glu Gly Ala Val 
330 

Glu Arg lie Gly 
345 

Phe Val lie Asp 



His Arg Asp Leu 
380 

Pro Val Asp Gly 
395 

Ser Gly lie Ala 
410 

Pro Gly Arg Tyr 
425 

Glu Tyr Lys Val 



Glu Arg Met His 
460 

Cys Ser Leu Pro 
475 

Met Pro Cys Cys 
490 

Val Asp Arg Tyr 
505 



Gin Ala Gin Ala 
205 

Val Ser Thr Val 



Ala Phe lie Gin 
240 

Ser Val Lys lie 
255 

lie Arg Arg Leu 
270 

Ala Asn Glu Asp 
285 

Asn Gin Thr Gly 



Lys lie Ala Pro 
320 

Thr lie Leu Pro 
335 

Gin Asp Ser Ala 
350 

Ala Val Tyr Ala 
365 

Cys Pro Gly Arg 



Thr Gin Leu Leu 
400 

Gly Asn Pro Val 
415 

Asp lie Tyr Gin 
430 

lie Gly Ser Trp 
445 

Trp Pro Gly Ser 



Cys Gin Pro Gly 
480 

Trp His Cys Glu 
495 

Thr Cys Lys Thr 
510 
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Cys Pro Tyr Asp Met Arg Pro Thr Glu Asn Arg Thr Gly Cys Arg Pro 
515 "520 525 



lie Pro lie lie Lys Leu Glu Trp Gly Ser Pro Trp Ala Val Leu Pro 
530 " 535 540 

"Leu Phe Leu Ala Val Val Gly lie Ala Ala Thr Leu Phe Val Val lie 
545 550 555 560 

Thr Phe Val Arg Tyr Asn Asp Thr Pro lie Val Lys Ala Ser Gly Arg 
565 570 575 

Glu Leu Ser Tyr Val Leu Leu Ala Gly lie Phe Leu Cys Tyr Ala Thr 
580 585 590- 

Thr Phe Leu Met lie Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg 
595 600 605 

Arg lie Phe Leu Gly Leu Gly Met Ser lie Ser Tyr Ala Ala Leu Leu 
610 615 620 

Thr Lys Thr Asn Arg lie Tyr Arg lie Phe Glu Gin Gly Lys Arg Ser 
625 630 635 640 

Val Ser Ala Pro Arg Phe lie Ser Pro Ala Ser Gin Leu Ala lie Thr 
645 650 655 

Phe Ser Leu lie Ser Leu Gin Leu Leu Gly lie Cys Val Trp Phe Val 
660 665 670 

Val Asp Pro Ser His Ser Val Val Asp Phe Gin Asp Gin Arg Thr Leu 
675 680 685 

Asp Pro Arg Phe Ala Arg Gly Val Leu Lys Cys Asp lie Ser Asp Leu 
690 695 700 

Ser Leu lie Cys Leu Leu Gly Tyr Ser Met Leu Leu Met Val Thr Cys 
705 710 715 720 

Thr Val Tyr Ala lie Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu 
725 730 . 735 

Ala Lys Pro lie Gly Phe Thr Met Tyr Thr Thr Cys lie Val Trp Leu 
740 745 750 

Ala Phe lie Pro lie Phe Phe Gly Thr Ser Gin Ser Ala Asp Lys Leu 
755 760 765 

Tyr lie Gin Thr Thr Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser 
770 775 ' 780 



Val Ser Leu Gly Met Leu Tyr Met Pro Lys Val Tyr lie lie Leu Phe 
785 790 795 800 

His Pro Glu Gin Asn Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val 
805 810 815 

Val Thr Ala Ala Thr Met Ser Asn Lys Phe Thr Gin Lys Gly Asn Phe 

820 - 825 830 



35 



Arg Pro Asn Gly 
835 



Glu Ala Lys 



Ser Glu Leu Cys Glu 
840 



Asn 
845 



Leu Glu Ala 



Pro Ala Leu Ala Thr Lys Gin Thr Tyr Val Thr Tyr Thr Asn His Ala 



lie 
865 



<210> 33 
<211> 845 
<212> PRT 
<213> HUMAN 

<400> 33 

Met Glu Thr Lys Gly Tyr His Ser Leu Pro Glu Gly Leu Asp Met Glu 
1 5 '10 15 

Arg Arg Trp Gly Gin Val Ser Gin Ala Val Glu Arg Ser Ser Leu Gly 
20 25 30 

Pro Thr Glu Arg Thr Asp Glu Asn Asn Tyr Met Glu. He Val Asn Val 
35 40 45 

Ser Cys Val Ser Gly Ala He Pro Asn Asn Ser Thr Gin Gly Ser Ser 
50 55 60 

Lys Glu Lys Gin Glu Leu Leu Pro Cys Leu Gin Gin Asp Asn Asn Arg 
65 70 75 30 

Pro Gly He Leu Thr Ser Asp He Lys Thr Glu Leu Glu Ser Lys Glu 
85 90 95 

Leu Ser Ala Thr Val Ala Glu Ser Met Gly Leu Tyr Met Asp Ser Val- 
100 105 110 

Arg Asp Ala Asp Tyr Ser Tyr Glu Gin Gin Asn Gin Gin Gly Ser Met 
115 120 . _ 125 

Ser Pro Ala Lys He Tyr Gin Asn Val Glu Gin Leu Val Lys Phe Tyr 
130 135 140 

Lys Gly Asn Gly His Arg Pro Ser Thr Leu Ser Cys, Val Asn Thr Pro 
145 150 155 160 

Leu Arg Ser Phe Met Ser Asp Ser Gly Ser Ser Val Asn Gly Gly Val 
165 170 175 

Met Arg Ala He Val Lys Ser Pro He Met Cys His Glu Lys Ser Pro 
180 185 190 

Ser Val Cys Ser Pro Leu Asn Met Thr Ser Ser Val Cys Ser Pro Ala 
195 200 205 

Gly He Asn Ser Val Ser ' Ser Thr Thr Ala Ser Phe Gly Ser Phe Pro 
210 215 220 

Val His Ser Pro He Thr Gin Gly Thr Pro Leu Thr Cys Ser Pro Asn 
225 230 235 240 



850 



855 



360 



Ala Glu Asn Arg Gly Ser Arg Ser His Ser Pro Ala His Ala Ser Asn 
245 - 250 255 



Val Gly Ser Pro Leu Ser Ser Pro Leu Ser Ser Met Lys Ser Ser lie 

260 265 270 

Ser Ser Pro Pro Ser His Cys Ser Val Lys Ser Pro Val Ser Ser Pro 

275 280 285 

Asn Asn Val Thr Leu Arg Ser Ser Val Ser Ser Pro Ala Asn lie Asn 

290 295 300 



Asn Ser Arg Cys Ser Val Ser Ser 
305 310 

Thr Leu Ser Ser Pro Ala Ala Ser 
325 

Val Asn Asn Ala Phe Ser Tyr Thr 
340 

Ser Thr Leu Arg Asp Val Val Pro 
355 360 

Ala Gin Glu Val Pro Phe Pro Lys 
370 375 



Glu Pro Asp Gly Ala Phe Ser Ser 
405 



Lys His Ser Cys Ser Gly Thr Ser 

435 440 

Pro Phe Pro Phe Met Asp Gly Ser 

450 455 

Asp Tyr Tyr Ser Leu Ser Gly lie 

465 470 



Pro Ser Asn Thr Asn Asn Arg Ser 
315 - 320 

Thr Val Gly Ser lie Cys Ser Pro 
330 335 

Ala Ser Gly Thr Ser Ala Gly Ser 
345 350 

Ser Pro Asp Thr Gin Glu Lys Gly 
365 

Thr Glu Glu Val Glu Ser Ala lie 
380 

lie Lys Pro 
400 

Ser Cys Leu Gly Gly Asn Ser Lys 
410 415 

Glu Ser Thr 
430 

Phe Lys Gly Asn Pro Thr Val Asn 
445 

Tyr Phe Ser Phe Met Asp Asp Lys 
460 

Leu Gly Pro Pro Val Pro Gly Phe 
475 480 



Ser Asn Gly Val Thr Gly Gin Leu Asn lie Val Gin Tyr 
385 390 395 



lie Asn Ser Asp Ser Ser Phe Ser Val Pro lie Lys Gin 
420 425 



Asp Gly Asn Cys Glu Gly Ser Gly Phe Pro Val Gly lie Lys Gin Glu 

485 490 495 

Pro Asp Asp Gly Ser Tyr Tyr Pro Glu Ala Ser lie Pro Ser Ser Ala 

500 505 510 

lie Val Gly Val Asn Ser Gly Gly Gin Ser Phe His Tyr Arg lie Gly 

"515 520 525 

Ala Gin Gly Thr lie Ser Leu Ser Arg Ser Ala Arg Asp Gin Ser Phe 

530 '^'535 540 

Gin His Leu Ser Ser Phe Pro Pro Val Asn Thr Leu Val Glu Ser Trp 

545 550 • 555 560 
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Lys Ser His Gly Asp Leu Ser Ser Arg Arg Ser Asp Gly Tyr Pro Val 
565 570 . 575 



Leu Glu Tyr lie Pro Glu Asn Val Ser Ser Ser Thr Leu Arg Ser Val 
580 535 590 

Ser Thr Gly Ser Ser Arg Pro Ser Lys lie Cys Leu Val Cys Gly Asp 
595 600 605 

Glu Ala Ser Gly Cys His Tyr Gly Val Val Thr Cys Gly Ser Cys Lys 
610 615 620 

Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
625 630 635 640 

Gly Arg Asn Asp Cys lie lie Asp Lys lie Arg Arg Lys Asn Cys Pro 
645 650 655 

Ala Cys Arg Leu Gin Lys Cys Leu Gin Ala Gly Met Asn Leu Gly Ala 
660 665 670 

Arg Lys Ser Lys Lys Leu Gly Lys Leu Lys Gly lie His Glu Glu Gin 
675 680 685 

Pro Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro* Pro Pro Gin Ser Pro 
690 695 700 

Glu Glu Gly Thr Thr Tyr lie Ala Pro Ala Lys Glu Pro Ser Val Asn 
705 710 715 720 

Thr Ala Leu Val Pro Gin Leu Ser Thr lie Ser Arg Ala Leu Thr Pro 
725 730 735 

Ser Pro Val Met Val Leu Glu Asn lie Glu Pro Glu lie Val Tyr Ala 
740 745 750 

Gly Tyr Asp Ser Ser Lys Pro Asp Thr Ala Glu Asn Leu Leu Ser Thr 
755 760 765 

Leu Asn Arg Leu Ala Gly Lys Gin Met lie Gin Val Val Lys Trp Ala 
770 775 780 

Lys Val Leu Pro Gly Phe Lys Asn Leu Pro Leu Glu Asp Gin lie Thr 
785 790 795 800 

Leu lie Gin Tyr Ser Trp Met Cys Leu Ser Ser Phe Ala Leu Ser Trp 
805 810 815 

Arg Ser Tyr Lys His Thr Asn Ser Gin Phe Leu Tyr Phe Ala Pro Asp 
820 825 830 

Leu Val Phe Asn Glu Leu Leu Ala Arg Val Arg Glu Gly 
335 840 845 



<210> 34 

<211> 837 

<212> PRT 

<213> HUMAN 
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<400> 34 

Met Glu Thr Lys Gly Tyr His Ser Leu Pro Glu Gly Leu Asp Met Glu 
1 ' 5 10 15 

Arg Arg Trp Gly Gin Val Ser Gin Ala Val Glu Arg Ser Ser Leu Gly 
20 25 30 

Pro Thr Glu Arg Thr Asp Glu Asn Asn Tyr Met Glu lie Val Asn Val 
35 40 45 

Ser Cys Val Ser Gly Ala lie Pro Asn Asn Ser Thr Gin Gly Ser Ser 
50 55 60 

Lys Glu Lys Gin Glu Leu Leu Pro Cys Leu Gin Gin Asp Asn Asn Arg 
65 -70 - 75 80 

Pro Gly lie Leu Thr Ser Asp lie Lys Thr Glu Leu Glu Ser Lys Glu 
85 90 95 

Leu Ser Ala Thr Val_ Ala Glu Ser Met Gly Leu Tyr Met Asp Ser Val 
100 " 105 110 

Arg Asp Ala Asp Tyr Ser Tyr Glu Gin Gin Asn Gin Gin Gly Ser Met 
115 120 125 

Ser Pro Ala Lys lie Tyr Gin Asn Val Glu Gin Leu Val Lys Phe Tyr 
130 135 140 

Lys Gly Asn Gly His Arg Pro Ser Thr Leu Ser Cys Val Asn Thr Pro 
145 150 155 160 

Leu Arg Ser Phe Met Ser Asp Ser Gly Ser Ser Val Asn Gly Gly Val 
165 170 175 

Met Arg Ala lie Val Lys Ser Pro lie Met Cys His Glu Lys Ser Pro 
-180 185 190 

Ser Val Cys Ser Pro Leu Asn Met Thr Ser Ser Val Cys Ser Pro Ala 
195 200 205 

Gly lie Asn Ser Val Ser Ser Thr Thr Ala Ser Phe Gly Ser Phe Pro 
210 215 220 

Val His Ser Pro lie Thr Gin Gly Thr Pro Leu Thr Cys Ser Pro Asn 
225 230 235 240 

Ala Glu Asn Arg Gly Ser Arg Ser His Ser Pro Ala His Ala Ser Asn 
245 250 255 

Val Gly Ser Pro Leu Ser Ser Pro Leu Ser Ser Met Lys Ser Ser lie 
260 265 270 

Ser Ser Pro Pro Ser His Cys Ser Val Lys Ser Pro Val Ser Ser Pro 
275 280 - _ 285 

Asn Asn Val Thr Leu Arg Ser Ser Val Ser Ser Pro Ala Asn He Asn 
290 * **^2 95 300 

Asn Ser Arg Cys Ser Val Ser Ser Pro Ser Asn Thr Asn Asn Arg Ser 
305 310 315 320 
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Thr Leu Ser Ser Pro Ala Ala Ser Thr Val Gly Ser lie Cys Ser Pro 
325 -36 0 - 335 



Val Asn Asn Ala Phe Ser Tyr Thr Ala Ser Gly Thr Ser Ala Gly Ser 
340 345 . 350 

Ser Thr Leu Arg Asp Val Val Pro Ser Pro Asp Thr Gin Glu Lys Gly 
355 360 365 

Ala Gin Glu Val Pro Phe Pro Lys Thr Glu Glu Val Glu Ser Ala lie 
370 375 380 

Ser Asn Gly Val Thr Gly Gin Leu Asn lie Val Gin Tyr lie Lys Pro 
385 390 395 400 

Glu Pro Asp Gly Ala Phe Ser Ser Ser Cys Leu Gly Gly Asn Ser Lys 
405 410 415 

lie Asn Ser Asp Ser Ser Phe Ser Val Pro He Lys Gin Glu Ser Thr 
420 425 430 

Lys His Ser Cys Ser Gly Thr Ser Phe Lys Gly Asn Pro Thr Val Asn 
435 440 445 

Pro Phe Pro Phe Met Asp Gly Ser Tyr Phe Ser Phe Met Asp Asp Lys 
450 455 460 

Asp Tyr Tyr Ser Leu Ser Gly He Leu Gly Pro Pro Val Pro Gly Phe 
465 470 475 480 

Asp Gly Asn Cys Glu Gly Ser Gly Phe Pro Val Gly lie Lys Gin Glu 
485 490 495 

Pro Asp Asp Gly Ser Tyr Tyr Pro Glu Ala Ser He Pro Ser Ser Ala 
500 505 510 

He Val Gly Val Asn Ser Gly Gly Gin Ser Phe His Tyr Arg He Gly 
515 520 525 

Ala Gin Gly Thr He Ser Leu Ser Arg Ser Ala Arg Asp Gin Ser Phe 
530 535 540 

Gin His Leu Ser Ser Phe Pro Pro Val Asn Thr Leu Val Glu Ser Trp 
545 550 555 560 

Lys Ser His Gly Asp Leu Ser Ser Arg Arg Se-r Asp Gly Tyr Pro Val 
565 570 575 

Leu Glu Tyr He Pro Glu Asn Val Ser Ser Ser Thr Leu Arg Ser Val 
580 585 590 

Ser Thr Gly Ser Ser Arg Pro Ser Lys He Cys Leu Val Cys Gly Asp 
595 600 605 

Glu Ala Ser Gly Cys His Tyr Gly Val Val Thr Cys Gly Ser Cys Lys 
610 615 620 

Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
625 630 " 635 640 

Gly Arg Asn Asp Cys He He Asp Lys He Arg Arg Lys Asn Cys Pro 
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645 



650 



655 



Ala Cys Arg Leu Gin Lys Cys Leu 
660 

Arg Lys Ser Lys Lys Leu Gly Lys 

675 680 

Pro Gin Gin Gin Gin Pro Pro Pro 
690 695 

Glu Glu Gly Thr Thr Tyr lie Ala 
705 710 

Thr Ala Leu Val Pro Gin Leu Ser 
725 

Ser Pro Val Met Val Leu Glu Asn 
740 



Gin Ala Gly Met Asn Leu Gly Ala 

665 670 

Leu Lys Gly lie His Glu Glu Gin 
685 

Pro Pro Pro Pro Pro Gin Ser Pro 
700 

Pro Ala Lys Glu Pro Ser Val Asn 

715 720 

Thr lie Ser A'rg Ala Leu Thr Pro 

730 735 

He Glu Pro Glu He Val Tyr Ala 

745 750 



Gly Tyr Asp Ser Ser Lys Pro Asp 
755 760 

Leu Asn Arg Leu Ala Gly Lys Gin 
770 775 

Lys Val Leu Pro Gly Phe Lys Asn 
785 790 

Leu He Gin Tyr Ser Trp Met Cys 
805 

Arg Ser Tyr Lys His Thr Asn Ser 
820 



Thr Ala Glu Asn Leu Leu Ser Thr 
765 

Met He Gin Val Val Lys Trp Ala 
780 ■ 

Leu Pro Leu Glu Asp Gin He Thr 
795 800 

Leu Ser Ser Phe Ala Leu Ser Trp 
810 815 

Gin Phe Leu Tyr Phe Ala Pro Asp 
825 830 



Leu Val Phe Asn Glu 
835 
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